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HUMAN CALCIUM CHANNEL COMPOSITIONS AND METHODS USING THEM 

TECHNICAL FIELD 

The present invention relates to molecular biology and 
pharmacology. More particularly, the invention relates to 
calcium channel compositions and methods of making and using 
the same. 

BACKGROUND OF THE INVENTION 

Calcium channels are membrane -spanning, multi-subunit 
proteins that allow controlled entry of Ca 2 * ions into cells 
from the extracellular fluid. Cells throughout the animal 
kingdom, and at least some bacterial, fungal and plant cells, 
possess one or more types of calcium channel. 

The most common type of calcium channel is voltage 
dependent. "Opening" of a voltage -dependent channel to allow 
an influx of Ca 2 * ions into the cells requires a depolarization 
to a certain level of the potential difference between the 
inside of the cell bearing the channel and the extracellular 
medium bathing the cell. The rate of influx of Ca 2 * into the 
cell depends on this potential difference. All "excitable" 
cells in animals, such as neurons of the central nervous 
system (CNS) , peripheral nerve cells and muscle cells, 
including those of skeletal muscles, cardiac muscles, and 
venous and arterial smooth muscles, have voltage -dependent 
calcium channels. 

Multiple types of calcium channels have been identified 
in mammalian cells from various tissues, including skeletal 
muscle, cardiac muscle, lung, smooth muscle and brain, [see, 
e.g., Bean, B. P. (1989) Ann. J?ev. Physiol. 52:367-384 and Hess, 
P. (1990) Ann. Rev. Neurosci . 55:337] . The different types of 
calcium channels have been broadly categorized into four 
classes, L- , T-, N- , and P-type, distinguished by current 
kinetics, holding potential sensitivity and sensitivity to 
calcium channel agonists and antagonists. 

Calcium channels are multisubunit proteins that contain 
two large subunits, designated or l and ot 2 , which have molecular 
weights between about 130 and about 200 kilodaltons ("kD"), 
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and one to three different smaller subunits of less than about 
60 kD in molecular weight. At least one of the larger 
subunits and possibly some of the smaller subunits are 
glycosylated. Some of the subunits are capable of being 
phosphorylated. The ci x subunit has a molecular weight of 
about 150 to about 170 kD when analyzed by sodium 
dodecylsulfate (SDS) -polyacryl amide gel electrophoresis (PAGE) 
after isolation from mammalian muscle tissue and has specific 
binding sites for various 1, 4-dihydropyridines (DHPs) and 
phenylalkylamines. Under non-reducing conditions (in the 
presence of N-ethylmaleimide) , the o/ 2 subunit migrates in 
SDS - PAGE as a band corresponding to a molecular weight of 
about 160-190 kD. Upon reduction, a large fragment and 
smaller fragments are released. The 0 subunit of the rabbit 
skeletal muscle calcium channel is a phosphorylated protein 
that has a molecular weight of 52-65 kD as determined by SDS- 
PAGE analysis. This subunit is insensitive to reducing 
conditions. The y subunit of the calcium channel, which is 
not observed in all purified preparations, appears to be a 
glycoprotein with an apparent molecular weight of 30-33 kD. as 
determined by SDS-PAGE analysis. 

In order to study calcium channel structure and function, 
large amounts of pure channel protein are needed. Because of 
the complex nature of these multisubunit proteins, the varying 
concentrations of calcium channels in tissue sources of the 
protein, the presence of mixed populations of calcium channels 
in tissues, difficulties in obtaining tissues of interest, and 
the modifications of the native protein that can occur during 
the isolation procedure, it is extremely difficult to obtain 
large amounts of highly purified, completely intact calcium 
channel protein. 

Characterization of a particular type of calcium channel 
by analysis of whole cells is severely restricted by the 
presence of mixed populations of different types of calcium 
channels in the majority of cells. Single-channel recording 
methods that are used to examine individual calcium channels 
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do not reveal any information regarding the molecular 
structure or biochemical composition of the channel . 
Furthermore, in performing this type of analysis, the channel 
is isolated from other cellular constituents that might be 
important for natural functions and pharmacological 
interactions . 

Characterization of the gene or genes encoding Calcium 
channels provides another means of characterization of 
different types of calcium channels. The amino acid sequence 
determined from a complete nucleotide sequence of the coding 
region of a gene encoding a calcium channel protein represents 
the primary structure of the protein. Furthermore, secondary 
structure of the calcium channel protein and the relationship 
of the protein to the membrane may be predicted based on 
analysis of the primary structure. For instance, hydropathy 
plots of the a 1 subunit protein of the rabbit skeletal muscle 
calcium channel indicate that it contains four internal 
repeats, each containing six putative transmembrane regions 
[Tanabe, T. et al . (1987) Mature 326:313]. 

Because calcium channels are present in various tissues 
and have a central role in regulating intracellular calcium 
ion concentrations, they are implicated in a number of vital 
processes in animals, including neurotransmitter release, 
muscle contraction, pacemaker activity, and secretion of 
hormones and other substances. These processes appear to be 
involved in numerous human disorders, such as CNS and 
cardiovascular diseases. Calcium channels, thus, are also 
implicated in numerous disorders. A number of compounds 
useful for treating various cardiovascular diseases in 
animals, including humans, are thought to exert their 
beneficial effects by modulating functions of voltage- 
dependent calcium channels present in cardiac and/or vascular 
smooth muscle. Many of these compounds bind to calcium 
channels and block, or reduce the rate of, influx of Ca 2+ into 
the cells in response to depolarization of the cell membrane. 
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The results of studies of recombinant expression of 
rabbit calcium channel subunit -encoding cDNA clones and 
transcripts of the cDNA clones indicate that the a 1 subunit 
forms the pore through which calcium enters cells. The 
relevance of the barium currents generated in these 
recombinant cells to the actual current generated by calcium 
channels containing as one component the respective a x 
subunits in vivo is unclear. In order to completely and 
accurately characterize and evaluate different calcium channel 
types, however, it is essential to examine the functional 
properties of recombinant channels containing all of the 
subunits as found in vivo. 

In order to conduct this examination and to fully 
understand calcium channel structure and function, it is 
critical to identify and characterize as many calcium channel 
subunits as possible. Also in order to prepare recombinant 
cells for use in identifying compounds that interact with 
calcium channels, it is necessary to be able to produce cells 
that express uniform populations of calcium channels 
containing defined subunits. 

An understanding of the pharmacology of compounds that 
interact with calcium channels in other organ systems, such as 
the CNS, may aid in the rational design of compounds that 
specifically interact with subtypes of human calcium channels 
to have desired therapeutic effects, such as in the treatment 
of neurodegenerative and cardiovascular disorders . Such 
understanding and the ability to rationally design 
therapeutically effective compounds, however, have been 
hampered by an inability to independently determine the types 
of human calcium channels and the molecular nature of 
individual subtypes, particularly in the CNS, and by the 
unavailability of pure preparations of specific channel 
subtypes to use for evaluation of the specificity of calcium 
channel-ef f ecting compounds. Thus, identification of DNA 
encoding human calcium channel subunits and the use of such 
DNA for expression of calcium channel subunits and functional 
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calcium channels would aid in screening and designing 
therapeutically effective compounds. 

Therefore, it is an object herein, to provide DNA 
encoding specific calcium channel subunits and to provide 
eukaryotic cells bearing recombinant tissue-specific or 
subtype- specific calcium channels. It is also an object to 
provide assays for identification of potentially therapeutic 
compounds that act as calcium channel antagonists and 
agonists . 
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SUMMARY OF THE INVENTION 

Isolated and purified nucleic acid fragments that encode 
human calcium channel subunits are provided. DNA encoding a, 
subunits of a human calcium channel, and RNA, encoding such 
subunits, made upon transcription of such DNA are provided. 
In particular, DNA fragments encoding ^ subunits of voltage- 
dependent human calcium channels (VDCCs) type A, type B (also 
referred to as VDCC IV), type C (also referred to as VDCC II) 
type D (also referred to as VDCC III) and type E are provided. 

DNA encoding at 1A , a 1B , a ic , a iv and a is subunits is provided. 
DNA encoding an a 1D subunit that includes the amino acids 
substantially as set forth as residues 10-2161 of SEQ ID No. 
1 is provided. DNA encoding an a lt> subunit that includes 
substantially the amino acids set forth as amino acids 1-34 in 
SEQ ID No. 2 in place of amino acids 373-406 of SEQ ID No. l 
is also provided. DNA encoding an o lc subunit that includes 
the amino acids substantially as set forth in SEQ ID No. 3 or 
SEQ ID No. 6 and DNA encoding an a 1B subunit that includes an 
amino acid sequence substantially as set forth in SEQ ID No. 
7 or in SEQ ID No. 8 is also provided. 

DNA encoding a lA subunits is also provided. Such DNA 
includes DNA encoding an or 1A subunit that has substantially the 
same sequence of amino acids as encoded by the DNA set forth 
in SEQ ID No. 22 or No. 23 or other splice variants of a 1A that 
include all or part of the sequence set forth in SEQ ID No. 22 
or 23. The sequence set forth in SEQ ID NO. 22 is a splice 
variant designated a 1A . lf - and the sequence set forth in SEQ ID 
NO. 23 is a splice variant designated a 1A . 2 . DNA encoding or 1A 
subunits also include DNA encoding subunits that can be 
isolated using all or a portion of the DNA having SEQ ID NO. 
21, 22 or 23 or DNA obtained from the phage lysate of an E. 
coli host containing DNA encoding an a 1A subunit that has been 
deposited in the American Type Culture Collection, 12301 
Parklavm Drive, Rockville. Maryland 20852 U.S.A. under 
Accession No. 75293 in accord with the Budapest Treaty. The 
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DNA in such phage includes a DNA fragment having the sequence 
set forth in SEQ ID No. 21. This fragment selectively 
hybridizes under conditions of high stringency to DNA encoding 
a 1A but not to DNA encoding a 1B and, thus, can be used to 
isolate DNA that encodes ck 1a subunits . 

DNA encoding ct ie subunits of a human calcium channel is 
also provided. This DNA includes DNA that encodes - an or 1E 
splice variant designated ot^^ encoded by the DNA set forth in 
SEQ ID No. 24, and a variant designated a 1E . 3 encoded by SEQ ID 
No. 25. This DNA also includes other splice variants thereof 
that encodes sequences of amino acids encoded by all or a 
portion of the sequences of nucleotides set forth in SEQ ID 
Nos. 24 and 25 and DNA that hybridizes under conditions of 
high stringency to the DNA of SEQ ID. No. 24 or 25 and that 
encodes an a 1E splice variant. 

DNA encoding ot 2 subunits of a human calcium channel, and 
RNA encoding such subunits, made upon transcription of such a 
DNA are provided. DNA encoding splice variants of the o 2 
subunit, including tissue-specific splice variants, are also 
provided. In particular, DNA encoding the a 2a -a 2e subunit 
subtypes is provided. In particularly preferred embodiments, 
the DNA encoding the or 2 subunit that is produced by 
alternative processing of a primary transcript that includes 
DNA encoding the amino acids set forth in SEQ ID 11 and the 
DNA of SEQ ID No. 13 inserted between nucleotides 1624 and 
1625 of SEQ ID No. 11 is provided. The DNA and amino acid 
sequences of a 3a - a 2e are set forth in SEQ. ID Nos. 11 and 
29-32, respectively. 

Isolated and purified DNA fragments encoding human 
calcium channel )3 subunits, including DNA encoding /J 1# 0 2 , 0 3 
and /3< subunits, and splice variants of the /? subunits are 
provided. RNA encoding /3 subunits, made upon transcription of 
the DNA is also provided. 

DNA encoding a subunit that is produced by alternative 
processing of a primary transcript that includes DNA encoding 
the amino acids set forth in SEQ ID No. 9, but including the 
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DNA set forth in SEQ ID No. 12 inserted in place of 
nucleotides 615-781 of SEQ ID No. 9 is also provided. DNA 
encoding 0, subunits that are encoded by transcripts that have 
the sequence set forth in SEQ ID No. 9 including the DNA set 
forth in SEQ ID No. 12 inserted in place of nucleotides 615- 
781 of SEQ ID No. 9, but that lack one or more of the 
following sequences of nucleotides: nucleotides 14-34 of SEQ 
ID No. 12, nucleotides 13-34 of SEQ ID No. 12, nucleotides 35- 
55 of SEQ ID No 12, nucleotides 56-190 of SEQ ID No. 12 and 
nucleotides 191-271 of SEQ ID No. 12 are also provided. In 
particular, p i subunit splice variants (3^-0^ (see, SEQ ID 
Nos. 9, 10 and 33-35) described below, are provided. 

B 2 subunit splice variants P 2c -0 2e , that include all or a 
portion of SEQ ID Nos. 26, 29 and 30 are provided,- 0 3 subunit 
splice variants, including 0 3 subunit splice variants that 
have the sequences set forth in SEQ ID Nos 19 and 20, and DNA 
encoding the (3 4 subunit that includes DNA having the sequence 
set forth in SEQ ID No. 27 and the amino acid sequence set 
forth in SEQ ID No. 28 are provided. 

Also Escherichia coli (E. coli) host cells harboring 
plasmids containing DNA encoding 0 3 have been deposited in 
accord with the Budapest Treaty under Accession No. 69048 at 
the American Type Culture Collection. The deposited clone 
encompasses nucleotides 122-457 in SEQ ID No. 19 and 107-443 
in SEQ ID No. 20 . 

DNA encoding 0 subunits that are produced by alternative 
processing of a primary transcript encoding a /3 subunit, 
including a transcript that includes DNA encoding the amino 
acids set forth in SEQ ID No. 9 or including a primary 
transcript that encodes £ 3 as deposited under ATCC Accession 
No. 6904 8, but lacking and including alternative exons are 
provided or may be constructed from the DNA provided herein. 

DNA encoding y subunits of human calcium channels is also 
provided. RNA, encoding y subunits, made upon transcription 
of the DNA are also provided. in particular, DNA containing 
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the sequence of nucleotides set forth in SEQ ID No. 14 is 
provided. 

Full-length DNA clones and corresponding RNA transcripts, 
encoding o 1( including splice variants of a 1A , a 1D , a 1B , a ici and 
a 1E , a 2 and 0 subunits, including fi^-fixs. P20 Pid* Pat> P2-1 and 
0, of human calcium channels are provided. Also provided are 
DNA clones encoding a substantial portions of the certain o lc 
subtype subunits and y subunits of voltage-dependent human 
calcium channels for the preparation of full-length DNA clones 
encoding the corresponding full-length subunits. Full-length 
clones may be readily obtained using the disclosed DNA as a 
probe as described herein. 

The the subunit, o lc subunit, or 1E subunit and splice 

variants thereof, the 0 JD , 0 2C and 0 2E subunits and 0< subunits 
and nucleic acids encoding these subunits are of particular 
interest herein. 

Eukaryotic cells containing heterologous DNA encoding one 
or more calcium channel subunits, particularly human calcium 
channel subunits, or containing RNA transcripts of DNA clones 
encoding one or more of the subunits are provided. A single 
a t subunit can form a channel. The requisite combination of 
subunits for formation of active channels in selected cells, 
however, can be determined empirically using the methods 
herein. For example, if a selected a, subtype or variant does 
not form an active channel in a selected cell line, an 
additional subunit or subunits can be added until an active 
channel is formed. 

In preferred embodiments, the cells contain DNA or RNA 
encoding a human a, subunit, preferably at least an ot 1D , a„, a 1A 
or o 1E subunit. In more preferred embodiments, the cells 
contain DNA or RNA encoding additional heterologous subunits, 
including at least one 0, a 2 or y subunit. In such 
embodiments, eukaryotic cells stably or transiently 
transfected with any combination of one, two, three or four of 
the subunit -encoding DNA clones, such as DNA encoding any of 
a a , a a + /5, + 0 + a 3 . are provided. 



WO 95/04822 



PCT/US94/09230 



The eukaryotic cells provided herein contain heterologous 
DNA that encodes an ct x subunit or heterologous DNA that 
encodes an a, subunit and heterologous DNA that encodes a 0 
subunit. At least one subunit is selected a XA . lf a^, a 
a iE-i> QfiE-a/ P20 /3 2 d/ 02F» a ^3.1, P 3 . 2 subunit or a /3 4 subunit. Si 
preferred embodiments, the cells express such heterologous 
calcium channel subunits and include one or more of the 
subunits in membrane -spanning heterologous calcium channels. 
In more preferred embodiments, the eukaryotic cells express 
functional, heterologous calcium channels that are capable of 
gating the passage of calcium channel -selective ions and/or 
binding compounds that, at physiological concentrations, 
modulate the activity of the heterologous calcium channel . in 
certain embodiments, the heterologous calcium channels include 
at least one heterologous calcium channel subunit. in most 
preferred embodiments, the calcium channels that are expressed 
on the surface of the eukaryotic cells are composed 
substantially or entirely of subunits encoded by the 
heterologous DNA or RNA. m preferred embodiments, the 
heterologous calcium channels of such cells are 
distinguishable from any endogenous calcium channels of the 
host cell. Such cells provide a means to obtain homogeneous 
populations of calcium channels. Typically, the cells contain 
the selected calcium channel as the only heterologous ion 
channel expressed, by the cell . 

In certain embodiments the recombinant eukaryotic cells 
that contain the heterologous DNA encoding the calcium channel 
subunits are produced by transfection with DNA encoding one or 
more of the subunits or are injected with RNA transcripts of 
DNA encoding one or more of the calcium channel subunits. The 
DNA may be introduced as a linear DNA fragment or may be 
included in an expression vector for stable or transient 
expression of the subunit-encoding DNA. Vectors containing 
DNA encoding human calcium channel subunits are also provided. 
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The eukaryotic cells that express heterologous calcium 
channels may be used in assays for calcium channel function 
or, in the case of cells transformed with fewer subunit- 
encoding nucleic acids than necessary to constitute a 
functional recombinant human calcium channel, such cells may 
be used to assess the effects of additional subunits on 
calcium channel activity. The additional subunits can be 
provided by subsequently transfecting such a cell with one or 
more DNA clones or RNA transcripts encoding human calcium 
channel subunits . 

The recombinant eukaryotic cells that express membrane 
spanning heterologous calcium channels may be used in methods 
for identifying compounds that modulate calcium channel 
activity. In particular, the cells are used in assays that 
identify agonists and antagonists of calcium channel activity 
in humans and/or assessing the contribution of the various 
calcium channel subunits to the transport and regulation of 
transport of calcium ions. Because the cells constitute 
homogeneous populations of calcium channels, they provide a 
means to identify agonists or antagonists of calcium channel 
activity that are specific for each such population. 

The assays that use the eukaryotic cells for identifying 
compounds that modulate calcium channel activity are also 
provided. In practicing these assays the eukaryotic cell that 
expresses a heterologous calcium channel, containing at least 
on subunit encoded by the DNA provided herein, is in a 
solution containing a test compound and a calcium channel 
selective ion, the cell membrane is depolarized, and current 
flowing into the cell is detected. If the test compound is 
one that modulates calcium channel activity, the current that 
is detected is different from that produced by depolarizing 
the same or a substantially identical cell in the presence of 
the same calcium channel -selective ion but in the absence of 
the compound. In preferred embodiments, prior to the 
depolarization step, the cell is maintained at a holding 
potential which substantially inactivates calcium channels 
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which are endogenous to the cell. Also in preferred 

embodiments, the cells are mammalian cells, most preferably 
HEK cells, or amphibian oocytes. 

Nucleic acid probes, typically labeled for detection, 
containing at least about 14, preferably 16, or, if desired, 
20 or 3 0 or more, contiguous nucleotides of a iD , a lc , a 1B , a 1A 
and a 1E , a 2 , (j, including fi„ p a ,p 3 and B t splice variants and 
y subunit- encoding DNA are provided. Methods using the probes 
for the isolation and cloning of calcium channel subunit - 
encoding DNA, including splice variants within tissues and 
inter- tissue variants are also provided. 

Purified human calcium channel subunits and purified 
human calcium channels are provided. The subunits and 
channels can be isolated from a eukaryotic cell transfected 
with DNA that encodes the subunit. 

In another embodiment, immunoglobulins or antibodies 
obtained from the serum of an animal immunized with a 
substantially pure preparation of a human calcium channel, 
human calcium channel subunit or epitope -containing fragment 
of a human calcium subunit are provided. Monoclonal 
antibodies produced using a human calcium channel, human 
calcium channel subunit or epitope -containing fragment thereof 
as an immunogen are also provided. E. coli fusion proteins 
including a fragment of a human calcium channel subunit may 
also be used as immunogen. Such fusion proteins may contain 
a bacterial protein or portion thereof, such as the E. coli 
TrpE protein, fused to a calcium channel subunit peptide. The 
immunoglobulins that are produced using the calcium channel 
subunits or purified calcium channels as immunogens have, 
among other properties, the ability to specifically and 
preferentially bind to and/or cause the immunoprecipitation of 
a human calcium channel or a subunit thereof which may be 
present in a biological sample or a solution derived from such 
a biological sample. Such antibodies may also be used to 
selectively isolate cells that express calcium channels that 
contain the subunit for which the antibodies are specific. 
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Methods for modulating the activity of ion channels by 
contacting the calcium channels with an effective amount of 
the above-described antibodies are also provided. 

A diagnostic method for determining the presence of 
Lambert Eaton Syndrome (LES) in a human based on immunological 
reactivity of LES immunoglobulin G (IgG) with a human calcium 
channel subunit or a eukaryotic cell which expresses a 
recombinant human calcium channel or a subunit thereof is also 
provided. In particular, an immunoassay method for diagnosing 
Lambert -Eaton Syndrome in a -person by combining serum or an 
IgG fraction from the person (test serum) with calcium channel 
proteins, including the a and £ subunits, and ascertaining 
whether antibodies in the test serum react with one or more of 
the subunits, or a recombinant cell which expresses one or 
more of the subunits to a greater extent than antibodies in 
control serum, obtained from a person or group of persons 
known to be free of the Syndrome, is provided. Any 
immunoassay procedure known in the art for detecting 
antibodies against a given antigen in serum can be employed in 
the method. 

DETAILED DESCRIPTION OF THE INVENTION 
Definitions : 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as is commonly 
understood by one of skill in the art to which this invention 
belongs. All patents and publications referred to herein are 
incorporated by reference herein. 

Reference to each of the calcium channel subunits 
includes the subunits that are specifically disclosed herein 
and human calcium channel subunits encoded by DNA that can be 
isolated by using the DNA disclosed as probes and screening an 
appropriate human cDNA or genomic library under at least low 
stringency. Such DNA also includes DNA that encodes proteins 
that have about 4 0% homology to any of the subunits proteins 
described herein or DNA that hybridizes under conditions of at 
least low stringency to the DNA provided herein and the 
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protein encoded by such DNA exhibits additional identifying 
characteristics, such as function or molecular weight. 

It is understood that subunits that are encoded by 
transcripts that represent splice variants of the disclosed 
subunits or other such subunits may exhibit less than 40% 
overall homology to any single subunit, but will include 
regions of such homology to one or more such subunits. It is 
also understood that 40% homology refers to proteins that 
share approximately 4 0% of their amino acids in common or that 
share somewhat less, but include conservative amino acid 
substitutions, whereby the activity of the protein is not 
substantially altered. 

As used herein, the ot 1 subunits types, encoded by 
different genes, are designated as type ot^, a 1B , a lc , «„ and 
a 1E . These types have also been referred to as VDCC IV for a 1B , 
VDCC II for a lc and VDCC III for a 1D . Subunit subtypes, which 
are splice variants, are referred to, for example as a 1B . lt a„. 
2' Oic-i etc. 

Thus, as used herein, DNA encoding the a x subunit refers 
to DNA that hybridizes to the DNA provided herein under 
conditions of at least low stringency or encodes a subunit 
that has at least about 4 0% homology to protein encoded by DNA 
disclosed herein that encodes an subunit of a human 

calcium. An a 1 subunit may be identified by its ability to 
form a calcium channel. Typically, a x subunits have 

molecular masses greater than at least about 120 kD. Also, 
hydropathy plots of deduced a, subunit amino acid sequences 
indicate that the a, subunits contain four internal repeats, 
each containing six putative transmembrane domains. 

The activity of a calcium channel may be assessed in 
vitro by methods known to those of skill in the art, including 
the electrophysiological and other methods described herein. 
Typically, a, subunits include regions to which one or more 
modulators of calcium channel activity, such as a 1,4-DHP or 
u-CgTx, interact directly or indirectly. Types of subunits 
may be distinguished by any method known to those of skill in 
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the art, including on the basis of binding specificity. For 
example, it has been found herein that a 1B subunits participate 
in the formation channels that have previously been referred 
to as N-type channels, oi 1D subunits participate in the 
formation of channels that had previously been referred to as 
L-type channels, and a 1A subunits appear to participate in the 
formation of channels that exhibit characteristics typical of 
channels that had previously been designated P-type channels. 
Thus, for example, the activity of channels that contain the 
c* 1B subunit are insensitive to 1,4-DHPs; whereas the activity 
of channels that contain the a 1D subunit are modulated or 
altered by a 1,4-DHP. It is presently preferable to refer to 
calcium channels based on pharmacological characteristics and 
current kinetics and to avoid historical designations. Types 
and subtypes of a 1 subunits may be characterized on the basis 
of the effects of such modulators on the subunit or a channel 
containing the subunit as well as differences in currents and 
current kinetics produced by calcium channels containing the 
subunit . 

As used herein, an a 2 subunit is encoded by DNA that 
hybridizes to the DNA provided herein under conditions of low 
stringency or encodes a protein that has at least about 4 0% 
homology with that disclosed herein. Such DNA encodes a 
protein that typically has a molecular mass greater than about 
12 0 kD, but does not form a calcium channel in the absence of 
an a 1 subunit, and may alter the activity of a calcium channel 
that contains an subunit. Subtypes of the a 2 subunit that 
arise as splice variants are designated by lower case letter, 
such as o 2a , . . . a Je . In addition, the or 2 subunit and the 
large fragment produced when the protein is subjected to 
reducing conditions appear to be glycosylated with at least 
N-linked sugars and do not specifically bind to the 1,4-DHPs 
and phenyl alkyl amines that specifically bind to the a 1 
subunit. The smaller fragment, the C-terminal fragment, is 
referred to as the 6 subunit and includes amino acids from 
about 946 (SEQ ID No. 11) through about the C- terminus . This 
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fragment may dissociate from the remaining portion of or 2 when 
the a 3 subunit is exposed to reducing conditions. 

As used herein, a (3 subunit is encoded by DNA that 
hybridizes to the DNA provided herein under conditions of low 
stringency or encodes a protein that has at least about 4 0% 
homology with that disclosed herein and is a protein that 
typically has a molecular mass lower than the ex subunits and 
on the order of about 50-80 kD, does not form a detectable 
calcium channel in the absence of an subunit, but may alter 
the activity of a calcium channel that contains an a x subunit 
or that contains an a 2 and ot 2 subunit. 

Types of the 0 subunit that are encoded by different 
genes are designated with subscripts, such as jS lf /S 2 , /? 3 and 0 4 . 
Subtypes of /? subunits that arise as splice variants of a 
particular type are designated with a numerical subscript 
referring to the type and to the variant. Such subtypes 
include, but are not limited to the p 1 splice variants, 
including ft.i-^,., and 0 2 variants, including /3 2C -/S 2E . 

As used herein, a y subunit is a subunit encoded by DNA 
disclosed herein as encoding the y subunit and may be isolated 
and identified using the DNA disclosed herein as a probe by 
hybridization or other such method known to those of skill in 
the art, whereby full-length clones encoding a y subunit may 
be isolated or constructed. A y subunit will be encoded by 
DNA that hybridizes to the DNA provided herein under 
conditions of low stringency or exhibits sufficient sequence 
homology to encode a protein that has at least about 4 0% 
homology with the y subunit described herein. 

Thus, one of skill in the art, in light of the disclosure 
herein, can identify DNA encoding a lf a 3 , /3, <5 and y calcium 
channel subunits, including types encoded by different genes 
and subtypes that represent splice variants. For example, 
DNA probes based on the DNA disclosed herein may be used to 
screen an appropriate library, including a genomic or cDNA 
library, for hybridization to the probe and obtain DNA in one 
or more clones that includes an open reading fragment that 
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encodes an entire protein. Subsequent to screening an 
appropriate library with the DNA disclosed herein, the 
isolated DNA can be examined for the presence of an open 
reading frame from which the sequence of the encoded protein 
may be deduced. Determination of the molecular weight and 
comparison with the sequences herein should reveal the 
identity of the subunit as an at lt a 2 etc. subunit . Functional 
assays may, if necessary, be used to determine whether the 
subunit is an o^, a 2 subunit or /3 subunit. 

For example, DNA encoding an a 1A subunit may be isolated 
by screening an appropriate library with DNA, encoding all or 
a portion of the human a 1A subunit. Such DNA includes the DNA 
in the phage deposited under ATCC Accession No. 75293 that 
encodes a portion of an a 1 subunit. DNA encoding an ar 1A 
subunit may obtained from an appropriate library by screening 
with an oligonucleotide having all or a portion of the 
sequence set forth in SEQ ID No. 21, 22 and/or 23 or with the 
DNA in the deposited phage. Alternatively, such DNA may have 
a sequence that encodes an a 1A subunit that is encoded by SEQ 
ID NO. 22 or 23 . 

Similarly, DNA encoding /S 3 may be isolated by screening 
a human cDNA library with DNA probes prepared from the plasmid 
01.42 deposited under ATCC Accession No. 69048 or obtained 
from an appropriate library using probes having sequences 
prepared according to the sequences set forth in SEQ ID Nos . 
19 and/or 20. Also, DNA encoding /S, may be isolated by 
screening a human cDNA library with DNA # probes prepared 
according to DNA set forth in SEQ ID No. 27, which sets forth 
the DNA sequence of a clone encoding a /3« subunit . The amino 
acid sequence is set forth in SEQ ID No. 28. Any method known 
to those of skill in the art for isolation and identification 
of DNA and preparation of full-length genomic or cDNA clones, 
including methods exemplified herein, may be used. DNA 
encoding 

The subunit encoded by isolated DNA may be identified by 
comparison with the DNA and amino acid sequences of the 
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subunits provided herein. Splice variants share extensive 
regions of homology, but include non-homologous regions, 
subunits encoded by different genes share a uniform 
distribution of non- homologous sequences . 

As used herein, a splice variant refers to a variant 
produced by differential processing of a primary transcript of 
genomic DNA that results in more than one type of mRNA. 
Splice variants may occur within a single tissue type or among 
tissues (tissue-specific variants) . Thus, cDNA clones that 
encode calcium channel subunit subtypes that have regions of 
identical amino acids and regions of different amino acid 
sequences are referred to herein as "splice variants". 

As used herein, a "calcium channel -selective ion" is an 
ion that is capable of flowing through, or being blocked from 
flowing through, a calcium channel which spans a cellular 
membrane under conditions which would substantially similarly 
permit or block the flow of Ca 2 *. Ba 2 * is an example of an ion 
which is a calcium channel-selective ion. 

As used herein, a compound that modulates calcium channel 
activity is one that affects the ability of the calcium 
channel to pass calcium channel -selective ions or affects 
other detectable calcium channel features, such as current 
kinetics. Such compounds include calcium channel antagonists 
and agonists and compounds that exert their effect on the 
activity of the calcium channel directly or indirectly. 

As used herein, a "substantially pure" subunit or protein 
is a subunit or protein that is sufficiently free of other 
polypeptide contaminants to appear homogeneous by SDS-PAGE or 
to be unambiguously sequenced. 

As used herein, selectively hybridize means that a DNA 
fragment hybridizes to a second fragment with sufficient 
specificity to permit the second fragment to be identified or 
isolated from among a plurality of fragments. In general, 
selective hybridization occurs at conditions of high 
stringency. 
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As used herein, heterologous or foreign DNA and RNA are 
used interchangeably and refer to DNA or RNA that does not 
occur naturally as part of the genome in which it is present 
or which is found in a location or locations in the genome 
that differ from that in which it occurs in nature. It is DNA 
or RNA that is not endogenous to the cell and has been 
artificially introduced into the cell. Examples of 

heterologous DNA include, but are not limited to, DNA that 
encodes a calcium channel subunit and DNA that encodes RNA or 
proteins that mediate or alter expression of endogenous DNA by 
affecting transcription, translation, or other regulatable 
biochemical processes. The cell that expresses the 

heterologous DNA, such as DNA encoding a calcium channel 
subunit, may contain DNA encoding the same or different 
calcium channel subunits. The heterologous DNA need not be 
expressed and may be introduced in a manner such that it is 
integrated into the host cell genome or is maintained 
episomally . 

As used herein, operative linkage of heterologous DNA to 
regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational stop 
sites, and other signal sequences, refers to the functional 
relationship between such DNA and such sequences of 
nucleotides. For example, operative linkage of heterologous 
DNA to a promoter refers to the physical and functional 
relationship between the DNA and the promoter such that the 
transcription of such DNA is initiated from the promoter by an 
RNA polymerase that specifically recognizes, binds to and 
transcribes the DNA in reading frame. 

As used herein, isolated, substantially pure DNA refers 
to DNA fragments purified according to standard techniques 
employed by those skilled in the art [see, e.g., Maniatis et 
al. (1982) Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY]. 

As used herein, expression refers to the process by which 
nucleic acid is transcribed into mRNA and translated into 
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peptides, polypeptides, or proteins. If the nucleic acid is 
derived from genomic DNA, expression may, if an appropriate 
eukaryotic host cell or organism is selected, include splicing 
of the mRNA. 

As used herein, vector or plasmid refers to discrete 
elements that are used to introduce heterologous DNA into 
cells for either expression of the heterologous DNA or for 
replication of the cloned heterologous DNA. Selection and use 
of such vectors and plasmids are well within the level of 
skill of the art. 

As used herein, expression vector includes vectors 
capable of expressing DNA fragments that are in operative 
linkage with regulatory sequences, such as promoter regions, 
that are capable of effecting expression of such DNA 
fragments. Thus, an expression vector refers to a recombinant 
DNA or RNA construct, such as a plasmid, a phage, recombinant 
virus or other vector that, upon introduction into an 
appropriate host cell, results in expression of the cloned 
DNA. Appropriate expression vectors are well known to those 
of skill in the art and include those that are replicable in 
eukaryotic cells and/or prokaryotic cells and those that 
remain episomal or may integrate into the host cell genome. 

As used herein, a promoter region refers to the portion 
of DNA of a gene that controls transcription of DNA to which 
it is operatively linked. The promoter region includes 

specific sequences of DNA that are sufficient for RNA X 
polymerase recognition, binding and transcription initiation. 
This portion of the promoter region is referred to as the 
promoter. In addition, the promoter region includes sequences 
that modulate this recognition, binding and transcription 
initiation activity of the RNA polymerase. These sequences 
may be cis acting or may be responsive to trans acting 
factors. Promoters, depending upon the nature of the 
regulation, may be constitutive or regulated. 

As used herein, a recombinant eukaryotic cell is a 
eukaryotic cell that contains heterologous DNA or RNA. 
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As used herein, a recombinant or heterologous calcium 
channel refers to a calcium channel that contains one or more 
subunits that are encoded by heterologous DNA that has been 
introduced into and expressed in a eukaryotic cells that 
expresses the recombinant calcium channel . A recombinant 
calcium channel may also include subunits that are produced by 
DNA endogenous to the cell. In certain embodiments, the 
recombinant or heterologous calcium channel may contain only 
subunits that are encoded by heterologous DNA. 

As used herein, "functional" with respect to a 
recombinant or heterologous calcium channel means that the 
channel is able to provide for and regulate entry of calcium 
channel -selective ions, including, but not limited to, Ca 2 * or 
Ba 2 \ in response to a stimulus and/or bind ligands with 
affinity for the channel. Preferably such calcium channel 
activity is distinguishable, such as electrophysiological, 
pharmacological and other means known to those of skill in the 
art, from any endogenous calcium channel activity that in the 
host cell . 

As used herein, a peptide having an amino acid sequence 
substantially as set forth in a particular SEQ ID No. 
includes peptides that have the same function but may include 
minor variations in sequence, such as conservative amino acid 
changes or minor deletions or insertions that do not alter the 
activity of the peptide. The activity of a calcium channel 
receptor subunit peptide refers to its ability to form 
functional calcium channels with other such subunits. 

As used herein, a physiological concentration of a 
compound is that which is necessary and sufficient for a 
biological process to occur. For example, a physiological 
concentration of a calcium channel -selective ion is a 
concentration of the calcium channel - selective ion necessary 
and sufficient to provide an inward current when the channels 
open. 

As used herein, activity of a calcium channel refers to 
the movement of a calcium channel -selective ion through a 
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calcium channel. Such activity may be measured by any method 
known to those of skill in the art, including, but not limited 
to, measurement of the amount of current which flows through 
the recombinant channel in response to a stimulus. 

As used herein, a "functional assay" refers to an assay 
that identifies functional calcium channels. a functional 
assay, thus, is an assay to assess function. 

As understood by those skilled in the art, assay methods 
for identifying compounds, such as antagonists and agonists, 
that modulate calcium channel activity, generally requires 
comparison to a control. One type of a "control" cell or 
"control" culture is a cell or culture that is treated 
substantially the same as the cell or culture exposed to the 
test compound except that the control culture is not exposed 
to the test compound. Another type of a "control" cell or 
"control" culture may be a cell or a culture of cells which 
are identical to the transfected "cells except the cells 
employed for the control culture do not express functional 
calcium channels. in this situation, the response of test 
cell to the test compound is compared to the response (or lack 
of response) of the calcium channel -negative cell to the test 
compound, when cells or cultures of each type of cell are 
exposed to substantially the same reaction conditions in the 
presence of the compound being assayed. For example, in 
methods that use patch clamp electrophysiological procedures, 
the same cell can be tested in the presence and absence of the 
test compound, by changing the external solution bathing the 
cell as known in the art. 

It is also understood that each of the subunits disclosed 
herein may be modified by making conservative amino acid 
substitutions and the resulting modified subunits are 
contemplated herein. Suitable conservative substitutions of 
amino acids are known to those of skill in this art and may be 
made generally without altering the biological activity of the 
resulting molecule. Those of skill in this art recognize 
that, in general, single amino acid substitutions in non- 
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essential regions of a polypeptide do not substantially alter 
biological activity (see, e.g. . Watson et al . Molecular 
Biology of the Gene, 4th Edition, 1987, The Be jacmin/Cummings 
Pub. co., p. 224) . Such substitutions are preferably, although 
not exclusively, made in accordance with those set forth in 



follows : 






TABLE 1 


Original residue 


Conservative substitution 


Ala (A) 


Gly; Ser 


Arg (R) 


Lys 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin <Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His <H) 


Asn; Gin 


He (1) 


Leu; Val 


Leu (L) 


lie; Val 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; He 


Phe <F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


He; Leu 



Other substitutions are also permissible and may be 
determined empirically or in accord with known conservative 
substitutions. Any such modification of the polypeptide may 
be effected by any means known to those of skill in this art. 
Mutation may be effected by any method known to those of skill 
in the art, including site-specific or site-directed 
mutagenesis of DNA encoding the protein and the use of DNA 
amplification methods using primers to introduce and amplify 
alterations in the DNA template. 

Identification and isolation of DNA encoding human calcium 
channel subunits 

Methods for identifying and isolating DNA encoding o^, a 2 , 
j8 and y subunits of human calcium channels are provided. 

Identification and isolation of such DNA may be 
accomplished by hybridizing, under appropriate conditions, at 
least low stringency whereby DNA that encodes the desired 
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subunit is isolated, restriction enzyme -digested human DNA 
with a labeled probe having at least 14, preferably 16 or more 
nucleotides and derived from any contiguous portion of DNA 
having a sequence of nucleotides set forth herein by sequence 
identification number. Once a hybridizing fragment is 
identified in the hybridization reaction, it can be cloned 
employing standard cloning techniques known to those of skill 
in the art. Full-length clones may be identified by the 
presence of a complete open reading frame and the identity of 
the encoded protein verified by sequence comparison with the 
subunits provided herein and by functional assays to assess 
calcium channel- forming ability or other function. This 
method can be used to identify genomic DNA encoding the 
subunit or cDNA encoding splice variants of human calcium 
channel subunits generated by alternative splicing of the 
primary transcript of genomic subunit DNA. For instance, DNA, 
cDNA or genomic DNA, encoding a calcium channel subunit may be 
identified by hybridization to a DNA probe and characterized 
by methods known to those of skill in the art, such as 
restriction mapping and DNA sequencing, and compared to the 
DNA provided herein in order to identify heterogeneity or 
divergence in the sequences of the DNA. Such sequence 
differences may indicate that the transcripts from which the 
cDNA was produced result from alternative splicing of a 
primary transcript, if the non- homologous and homologous 
regions are clustered, or from a different gene if the non- 
homologous regions are distributed throughout the cloned DNA. 

Any suitable method for isolating genes using the DNA 
provided herein may be used. For example, oligonucleotides 
corresponding to regions of sequence differences have been 
used to isolate, by hybridization, DNA encoding the full- 
length splice variant and can be used to isolate genomic 
clones. A probe, based on a nucleotide sequence disclosed 
herein, which encodes at least a portion of a subunit of a 
human calcium channel, such as a tissue-specific exon, may be 
used as a probe to clone related DNA, to clone a full-length 
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CDNA clone or genomic clone encoding the human calcium channel 
subunit . 

Labeled, including, but not limited to, radioactively or 
enzymatically labeled, RNA or single-stranded DNA of at least 
14 substantially contiguous bases, preferably 16 or more, 
generally at least 3 0 contiguous bases of a nucleic acid which 
encodes at least a portion of a human calcium channel subunit, 
the sequence of which nucleic acid corresponds to a segment of 
a nucleic acid sequence disclosed herein by reference to a SEQ 
ID No. are provided. Such nucleic acid segments may be used 
as probes in the methods provided herein for cloning DNA 
encoding calcium channel subunits. See, generally, Sambrook 
et al. (1989) Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory Press. 

In addition, nucleic acid amplification techniques, which 
are well known in the art, can be used to locate splice 
variants of calcium channel subunits by employing 
oligonucleotides based on DNA sequences surrounding the 
divergent sequence primers for amplifying human RNA or genomic 
DNA. Size and sequence determinations of the amplification 
products can reveal splice variants. Furthermore, isolation 
of human genomic DNA sequences by hybridization can yield DNA 
containing multiple exons, separated by introns, that 
correspond to different splice variants of transcripts 
encoding human calcium channel subunits. 

DNA encoding types and subtypes of each of the a lt a 2 , (3 
and y subunit of volt age -dependent human calcium channels has 
been cloned herein by nucleic acid amplication of cDNA from 
selected tissues or by screening human cDNA libraries prepared 
from isolated poly A+ mRNA from cell lines or tissue of human 
origin having such calcium channels . Among the sources of 
such cells or tissue for obtaining mRNA are human brain tissue 
or a human cell line of neural origin, such as a neuroblastoma 
cell line, human skeletal muscle or smooth muscle cells, and 
the like. Methods of preparing cDNA libraries are well known 
in the. art [see generally Ausubel et al . (1987) Current 
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Protocols in Molecular Biology, Wiley-Interscience, New York; 
and Davis et al . (1986) Basic Methods in Molecular Biology, 
Elsevier Science Publishing Co., New York]. 

Preferred regions from which to construct probes include 
5 ' and/ or 3 ' coding sequences , sequences predicted to encode 
transmembrane domains, sequences predicted to encode 
cytoplasmic loops, signal sequences, ligand-binding sites, and 
other functionally significant sequences (see Table, below) . 
Either the full-length subunit -encoding DNA or fragments 
thereof can be used as probes, preferably labeled with 
suitable label means for ready detection. When fragments are 
used as probes, preferably the DNA sequences will be typically 
from the carboxyl -end-encoding portion of the DNA, and most 
preferably will include predicted transmembrane domain- 
encoding portions based on hydropathy analysis of the deduced 
amino acid sequence [see, e.g. . Kyte and Doolittle [(1982) J. 
Mol. Biol. 167 :1051 . 

Riboprobes that specific for human calcium channel 
subunit types or subtypes have been prepared. These probes 
are useful for identifying expression of particular subunits 
in selected tissues and cells. The regions from which the 
probes were prepared were identified by comparing the DNA and 
amino acid sequences of all known a or & subunit subtypes. 
Regions of least homology, preferably human-derived sequences, 
and generally about 250 to about 600 nucleotides were 
selected. Numerous riboprobes for a and 0 subunits have been 
prepared; some of these are listed in the following Table. 
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TABLE 2 
SUMMARY OF RKA PROBES 



SUBUNIT 
SPECIFICITY 


NUCLEOTIDE 
POSITION 


PROBE NAME 


PROBE TYPE 


ORIENTA- 
TION 


alA generic 


3357-3840 


pGEM7ZalA* 


riboprobe 


n/a 


761-790 


SE700 


oligo 


antisense 


3440-3464 


SE718 


oligo 


antisense 










alB generic 


3091-3463 


pGEM7ZolB cyt 


ribo robe 


n/a 


6635-6858 


pGEM7ZttlB h 


P 




alB-1 
specific 


6490-6676 


alB-1/187 


riboprobe 


n/ a 


olE generic 


3114-3462 


pGEM7ZolE 


riboprobe 


n/a 


a2b 


1321-1603 


pCRIIo2b 


riboprobe 


n/a 


/8 generic (?) 


212-236 


SE300 


oligo 


antisense 


01 generic 


1267-1291 


SE301 


oligo 


antisense 


01-2 
specific 


1333-1362 


SE17 


oligo 


antisense 




1682-1706 


SE23 


oligo 


sense 


2742-2766 


SE43 


oligo 


antisense 


27-56 


SE208 


oligo 


antisense 


34 0-364 


SE274 


oligo 


antisense 


340-364 


SE275 


oligo 


sense 


(S3 specific 


1309-1509 




riboprobe 


n/a 


04 specific 


1228-1560 




riboprobe 


n/a 


* The pGEM ser 


les are available from Promega, Madison WI; see also, 



U.S. Patent No. 4,766,072. 

The above -noted nucleotide regions are also useful in 
selecting regions of the protein for preparation of subunit- 
specific antibodies, discussed below. 

The DNA clones and fragments thereof provided herein thus 
can be used to isolate genomic clones encoding each subunit 
and to isolate any splice variants by hybridization screening 
of libraries prepared from different human tissues. Nucleic 
acid amplification techniques, which are well known in the 
art, can also be used to locate DNA encoding splice variants 
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of human calcium channel subunits. This is accomplished by 
employing oligonucleotides based on DNA sequences surrounding 
divergent sequence (s) as primers for amplifying human RNA or 
genomic DNA. Size and sequence determinations of the 
amplification products can reveal the existence of splice 
variants. Furthermore, isolation of human genomic DNA 
sequences by hybridization can yield DNA containing multiple 
exons, separated by introns, that correspond to different 
splice variants of transcripts encoding human calcium channel 
subunits . 

Once DNA encoding a calcium channel subunit is isolated, 
ribonuclease (RNase) protection assays can be employed to 
determine which tissues express mRNA encoding a particular 
calcium channel subunit or variant. These assays provide a 
sensitive means for detecting and quantitating an RNA species 
in a complex mixture of total cellular RNA. The subunit DNA 
is labeled and hybridized with cellular RNA. If complementary 
mRNA is present in the cellular RNA, a DNA -RNA hybrid results . 
The RNA sample is then treated with RNase, which degrades 
single -stranded RNA. Any RNA -DNA hybrids are protected from 
RNase degradation and can be visualized by gel electrophoresis 
and autoradiography. In situ hybridization techniques can 
also be used to determine which tissues express mRNA encoding 
a particular calcium channel subunit. The labeled subunit 
DNAs are hybridized to different tissue slices to visualize 
subunit mRNA expression. 

With respect to each of the respective subunits {a lt a 2 , 
(3 or 7) of human calcium channels, once the DNA encoding the 
channel subunit was identified by a nucleic acid screening 
method, the isolated clone was used for further screening to 
identify overlapping clones. Some of the cloned DNA fragments 
can and have been subcloned into an appropriate vector such as 
PIBI24/25 (IBI, New Haven, CT) , M13mpl8/19, pGEM4 , pGEM3 , 
pGEM7Z, pSP72 and other such vectors known to those of skill 
in this art, and characterized by DNA sequencing and 
restriction enzyme mapping. A sequential series of 
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overlapping clones may thus be generated for each of the 
subunits until a full-length clone can be prepared by methods, 
known to those of skill in the art, that include 
identification of translation initiation (start) and 
translation termination (stop) codons . For expression of the 
cloned DNA, the 5' noncoding region and other transcriptional 
and translational control regions of such a clone may be 
replaced with an efficient ribosome binding site and other 
regulatory regions as known in the art. Other modifications 
of the 5' end, known to those of skill in the art, that may be 
required to optimize translation and/or transcription 
efficiency may also be effected, if deemed necessary. 

Examples II-VIIII, below, describe in detail the cloning 
of each of the various subunits of a human calcium channel as 
well as subtypes and splice variants, including tissue- 
specific variants thereof. In the few instances in which 
partial sequences of a subunit are disclosed, it is well 
within the skill of the art, in view of the teaching herein, 
to obtain the corresponding full-length clones and sequence 
thereof encoding the subunit, subtype or splice variant 
thereof using the methods described above and exemplified 
below. 

Identification and isolation of DNA encoding 
subunits 

A number of voltage -dependent calcium channel subunit 
genes, which are expressed in the human CNS and in other 
tissues, have been identified and have been designated as a 1A , 
a 1B (or VDCC IV), a ic (or VDCC II), ar 1D (or VDCC III) and a 1E . 
DNA, isolated from a human neural cDNA library, that encodes 
each of the subunit types has been isolated. DNA encoding 
subtypes of each of the types, which arise as splice variants 
are also provided. Subtypes are herein designated, for 
example, as o a . 1( a 1B _ 2 . 

The subunits types A B, C, D and E of voltage- 

dependent calcium channels, and subtypes thereof, differ with 
respect to sensitivity to known classes of calcium channel 
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agonists and antagonists, such as DHPs, phenylalkylamines , 
omega conotoxin (u-CgTx) , the funnel web spider toxin G>-Aga- 
IV, and pyrazonoylguanidines . These subunit types also appear 
to differ in the holding potential and in the kinetics of 
currents produced upon depolarization of cell membranes 
containing calcium channels that include different types of 
subunit s . 

DNA that encodes an a a subunit that binds to at least one 
compound selected from among dihydropyridines , 
phenylalkylamines, w-CgTx, components of funnel web spider 
toxin, and pyrazonoylguanidines is provided. For example, the 
a 1B subunit provided herein appears to specifically interact 
with u-CgTx in N-type channels, and the a 1D subunit provided 
herein specifically interacts with DHPs in L-type channels . 
Identification and isolation of DNA 
encoding the a u human calcium channel 
subunit 

The of 1D subunit cDNA has been isolated using fragments, of 
the rabbit skeletal muscle calcium channel subunit cDNA as 
a probe to screen a cDNA library of a human neuroblastoma cell 
line, IMR32, to obtain clone al.36. This clone was used as a 
probe to screen additional IMR32 cell cDNA libraries to obtain 
overlapping clones, which were then employed for screening 
until a sufficient series of clones to span the length of the 
nucleotide sequence encoding the human a 1D subunit was 
obtained. Full-length clones encoding a 1D were constructed by 
ligating portions of partial a 1D clones as described in Example 
II. SEQ ID No. 1 shows the 7,635 nucleotide sequence of the 
cDNA encoding the ot 1D subunit. There is a 6,483 nucleotide 
sequence reading frame which encodes a sequence of 2,161 amino 
acids (as set . forth in SEQ ID No. 1) . 

SEQ ID No. 2 provides the sequence of an alternative exon 
encoding the IS6 transmembrane domain [see Tanabe, T., et al . 
(1987) Nature 328:313-318 for a description of transmembrane 
domain terminology] of the a 1D subunit. 

SEQ ID No. 1 also shows the 2,161 amino acid sequence 
deduced from the human neuronal calcium channel a 1D subunit 
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DNA. Based on the amino acid sequence, the a 1D protein has a 
calculated Mr of 245,163. The a 1D subunit of the calcium 
channel contains four putative internal repeated sequence 
regions . Four internally repeated regions represent 24 
putative transmembrane segments, and the amino- and 
carboxyl- termini extend intracellularly . 

The a 1D subunit has been shown to mediate DHP-sensitive , 
high-voltage-activated, long- lasting calcium channel activity. 
This calcium channel activity was detected when oocytes were 
co- injected with RNA transcripts encoding an a 1D and or a iv , 
ot 2b and Q x _ 2 subunits. This activity was distinguished from Ba 2+ 
currents detected when oocytes were injected with RNA 
transcripts encoding the ± a 2b subunits. These currents 

pharmacologically and biophysically resembled Ca 2 * currents 
reported for uninjected oocytes. 

Identification and isolation of DNA 
encoding the or u human calcium channel 
subunit 

Biological material containing DNA encoding a portion of 
the a 1A subunit had been deposited in the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, Maryland 20852 
U.S.A. under the terms of the Budapest Treaty on the 
International Recognition of Deposits of Microorganisms for 
Purposes of Patent Procedure and the Regulations promulgated 
under this Treaty. Samples of the deposited material are and 
will be available to industrial property offices and other 
persons legally entitled to receive them under the terms of 
the Treaty and Regulations and otherwise in compliance with 
the patent laws and regulations of the United States of 
America and all other nations or international organizations 
in which this application, or an application claiming priority 
of this application, is filed or in which any patent granted 
on any such application is granted. 

A portion of an a 1R subunit is encoded by an approximately 
3 kb insert in XgtlO phage designated al.254 in E. coli host 
strain NM514 . A phage lysate of this material has been 
deposited as at the American Type Culture Collection under 
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ATCC Accession No. 75293, as described above. DNA encoding a 1A 
may also be identified by screening with a probe prepared from 
DNA that has SEQ ID No. 21: 

5 ' CTCAGTACCATCTCTGATACCAGCCCCA 3 ' . 

o- 1A splice variants have been obtained. The sequences of 
two cv 1A splice variants, a la .i and a la _ 2 are set forth in SEQ. ID 
Nos . 22 and 23. Other splice variants may be obtained by 
screening a human library as described above or using all or 
a portion of the sequences set forth in SEQ ID Nos. 22 and 23. 

Identification and isolation of DNA 
encoding the a 1B human calcium channel 
subunit 

DNA encoding the a 1B subunit was isolated by screening a 
human basal ganglia cDNA library with fragments of the rabbit 
skeletal muscle calcium channel a x subunit -encoding cDNA. A 
portion of one of the positive clones was used to screen an 
IMR32 cell cDNA library. Clones that hybridized to the basal 
ganglia DNA probe were used to further screen an IMR32 cell 
cDNA library to identify overlapping clones that in turn were 
used to screen a human hippocampus cDNA library. In this way, 
a sufficient series of clones to span nearly the entire length 
of the nucleotide sequence encoding the human a 1B subunit was 
obtained. Nucleic acid amplification of specific regions of 
the IMR32 cell a 1B mRNA yielded additional segments of the o; 1B 
coding sequence . 

A full-length a 1B DNA clone was constructed by ligating 
portions of the partial cDNA clones as described in Example 
II. C. SEQ ID Nos. 7 and 8 show the nucleotide sequences of 
DNA clones encoding the a 1B subunit as well as the deduced 
amino acid sequences. The a lB subunit encoded by SEQ ID No. 
7 is referred to as the a 1B .! subunit to distinguish it from 
another a 1B subunit, a 1B _ 2 , encoded by the nucleotide sequence 
shown as SEQ ID No. 8, which is derived from alternative 
splicing of the ar 1B subunit transcript. 

Nucleic acid amplification of IMR32 cell mRNA using 
oligonucleotide primers designed according to nucleotide 
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sequences within the a^.j-encoding DNA has identified variants 
of the a 1B transcript that appear to be splice variants because 
they contain divergent coding sequences . 

Identification and isolation of DNA 
encoding the a lc human calcium channel 
sub unit 

Numerous or lc - specif ic DNA clones were isolated. 
Characterization of the sequence revealed the a lc coding 
sequence, the a ic initiation of translation sequence, and an 
alternatively spliced region of a lc . Alternatively spliced 
variants of the or lc subunit have been identified. SEQ ID No. 
3 sets forth DNA encoding a substantial protion of an a ic 
subunit. The DNA sequences set forth in SEQ ID No. 4 and No. 
5 encode two possible amino terminal ends of the a lc protein. 
SEQ ID No. 6 encodes an alternative exon for the IV S3 
transmembrane domain. The sequences of substantial 

portions of two a lc splice variants, designated and a ic . 2 , 

are set forth in SEQ ID NOs. 3 and 36, respectively. 

The isolation and identification of DNA clones encoding 
portions of the a ic subunit is described in detail in Example 
II . 

Identification and isolation of DNA 
encoding the a 1E human calcium channel 
subunit 

DNA encoding a 1E human calcium channel subunits have been 
isolated from an oligo dT-primed human hippocampus library. 
The resulting clones, which are splice variants, were 
designated and a 1E _ 3 . The subunit designated has the 

amino acid sequence set forth in SEQ ID No. 24, and a subunit 
designated a 1E . 3 has the amino acid sequence set forth in SEQ 
ID No. 25. These splice variants differ by virtue of a 57 base 
pair insert between nucleotides 2405 and 2406 of SEQ. ID No. 
24 . 

The a 1E subunits provided herein appear to participate in 
the formation of calcium channels that have properties of 
high-voltage activated calcium channels and low-voltage 
activated channels. These channels are rapidly inactivating 
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compared to other high voltage-activated calcium channels. 
In addition these channels exhibit pharmacological profiles 
that are similar to voltage-activated channels, but are also 
sensitive to DHPs and u-Aga-IVA, which block certain high 
voltage activated channels. Additional details regarding the 
electrophysiology and pharmacology of channels containing a 1E 
subunits is provided in Example VII. F. 

Identification and isolation of DNA 
encoding encoding additional a 2 human 
calcium channel sub unit types and 
subtypes 

DNA encoding additional cn 1 subunits can be isolated and 
identified using the DNA provided herein as described for the 
"ia/ c*ib> °tic <*id and or 1E subunits or using other methods known 
to those of skill in the art. In particular, the DNA provided 
herein may be used to screen appropriate libraries to isolate 
related DNA. Full-length clones can be constructed using 
methods, such as those described herein, and the resulting 
subunits characterized by comparison of their sequences and 
electrophysiological and pharmacological properties with the 
subunits exemplified herein. 

Identification and isolation of DNA encoding 0 
human calcium channel subunits 
DNA encoding /3 l 

To isolate DNA encoding the /Jj subunit, a human 
hippocampus cDNA library was screened by hybridization to a 
DNA fragment encoding a rabbit skeletal muscle calcium channel 
/8 subunit. A hybridizing clone was selected and was in turn 
used to isolate overlapping clones until the overlapping 
clones encompassing DNA encoding the entire the human calcium 
channel p subunit were isolated and sequenced. 

Five alternatively spliced forms of the human calcium 
channel /3 1 subunit have been identified and DNA encoding a 
number of forms have been isolated. These forms are 
designated fi lmi , expressed in skeletal muscle, p^, expressed 
in the CNS, also expressed in the in the CNS, p lmi , 

expressed in aorta tissue and HEK 293 cells, and P^. s , 
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expressed in HEK 293 cells. Full-length DNA clones encoding 
the 0^ and 0^3 subunits have been constructed. The subunits 
0 . Pi-4 and Pis have been identified by nucleic acid 

amplification analysis as alternatively spliced forms of the 
0 subunit. Sequences of the & splice variants are set forth 
in SEQ ID Nos. 9, 10 and 33-35. 
DNA encoding 0 2 

DNA encoding the 0 2 splice variants has been obtained. 
These splice variants include 0 2C -0 2E . Splice variants 0 2C -0 2E 
include all of sequence set forth in SEQ ID No. 26, except for 
the portion at the 5' end (up to nucleotide 182), which 
differs among splice variants. The sequence set forth in SEQ 
ID No. 26 encodes 0 2O . Additional splice variants may be 
isolated using the methods described herein and 
oligonucleotides including all or portions of the DNA set 
forth in SEQ ID. No. 26 or may be prepared or obtained as 
described in the Examples. The sequences of 0 2 splice 
variants 0 2C and 0 2E are set forth in SEQ ID Nos. 3 7 and 38, 
respectively. 

DNA encoding 0 3 

DNA encoding the 0 3 subunit and any splice variants 
thereof may be isolated by screening a library, as described 
above for the 0, subunit, using DNA probes prepared according 
to SEQ ID Nos . 19. 20 or using all or a portion of the 
deposited 0 3 clone plasmid 01.42 (ATCC Accession No. 69048). 

The E. coli host containing plasmid 01.42 that includes 
DNA encoding a 0 3 subunit has been deposited as ATCC Accession 
No. 69048 in the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 U.S.A. under the 
terms of the Budapest Treaty on the International Recognition 
of Deposits of Microorganisms for Purposes of Patent Procedure 
and the Regulations promulgated under this Treaty. Samples of 
the deposited material are and will be available to industrial 
property offices and other persons legally entitled to receive 
them under the terms of the Treaty and Regulations and 
otherwise in compliance with the patent laws and regulations 
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of the united States of America and all other nations or 
international organizations in which this application, or an 
application claiming priority of this application, is filed or 
in which any patent granted on any such application is 
granted . 

The 03 encoding plasmid is designated 01.42. The plasmid 
contains a 2 . 5 kb EcoRI fragment encoding 0 3 inserted into 
vector pGem 72 F( + ) and has been deposited in E. coli host 
strain DH5a. The sequences of 0 3 splice variants, designated 

and 0 3 . 2 are set forth in SEQ ID Nos . 19 and 20 
respectively. 

J^ ifi ? a ^ i0n t nd i8 °lati°* of DNA encoding the o2 
human calcium channel sub unit 

DNA encoding a human neuronal calcium channel a 2 subunit 
was isolated in a manner substantially similar to that used 
for isolating DNA encoding an a, subunit, except that a human 
genomic DNA library was probed under low and high stringency 
conditions with a fragment of DNA encoding the rabbit skeletal 
muscle calcium channel a 2 subunit. The fragment included 
nucleotides having a sequence corresponding to the nucleotide 
sequence between nucleotides 43 and 272 inclusive of rabbit 
back skeletal muscle calcium channel a 2 subunit cDNA as 
disclosed in PCT International Patent Application Publication 
No. WO 89/09834, which corresponds to U.S. Application Serial 
No. 07/620,520 (now allowed U.S. Application Serial No 
07/914,231), which is a continuation- in-part of United States 
Serial No. 176,899, filed April 4, 1988. 

Example IV describes the isolation of DNA clones encoding 
a 2 subunits of a human calcium channel from a human DNA 
library using genomic DNA and cDNA clones, identified by 
hybridization to the genomic DNA, as probes. 

SEQ ID Nos. li and 29-32 show the sequence of DNA 
encoding a 2 subunits. As described in Example V, nucleic acid 
amplification analysis of RNA from human skeletal muscle 
brain tissue and aorta using oligonucleotide primers specific 
for a region of the human neuronal a 2 subunit cDNA that 
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diverges from the rabbit skeletal muscle calcium channel a 2 
subunit cDNA identified splice variants of the human calcium 
channel or 2 subunit transcript . 

Identification and isolation of DNA encoding 7 

human calcium channel sub units 

DNA encoding a portion of a human neuronal calcium 
channel 7 subunit has been isolated as described in detail in 
Example VI. SEQ ID No. 14 shows the nucleotide sequence at 
the 3' -end of this DNA which includes a reading frame encoding 
a sequence of 43 amino acid residues. Since the portion that 
has been obtained is homologous to the rabbit clone, described 
in allowed co-owned U.S. Application Serial No. 07/482,384, 
the remainder of the clone can be obtained using routine 
methods . 
Antibodies 

Antibodies, monoclonal or polyclonal, specific for 
calcium channel subunit subtypes or for calcium channel types 
can be prepared employing standard techniques, known to those 
of skill in the art, using the subunit proteins or portions 
thereof as antigens. Anti-peptide and anti-fusion protein 
antibodies can be used [see, for example, Bahouth et al . 
(1991) Trends Pharmacol. Sci . 12:338-343; Current Protocols in 
Molecular Biology (Ausubel et al . , eds . ) John Wiley and Sons, 
New York (1984)] . Factors to consider in selecting portions 
of the calcium channel subunits for use as immunogens (as 
either a synthetic peptide or a recombinantly produced 
bacterial fusion protein) include antigenicity accessibility 
(i.e., extracellular and cytoplasmic domains), uniqueness to 
the particular subunit, and other factors known to those of 
skill in this art. 

The availability of subunit-specif ic antibodies makes 
possible the application of the technique of 
immunohistochemistry to monitor the distribution and 
expression density of various subunits ( e.g. , in normal vs 
diseased brain tissue) . Such antibodies could also be 
employed in diagnostic, such as LES diagnosis, and therapeutic 
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applications, such as using antibodies that modulate 
activities of calcium channels. 

The antibodies can be administered to a subject employing 
standard methods, such as, for example, by intraperitoneal, 
intramuscular, intravenous, or subcutaneous injection, implant 
or transdermal modes of administration, and the like. One of 
skill in the art can empirically determine dose forms, 
treatment regiments, etc., depending on the mode of 
administration employed. 

Subunit-specif ic monoclonal antibodies and polyclonal 
antisera have been prepared. The regions from which the 
antigens were identified by comparing the DNA and amino acid 
sequences of all known or or /? subunit subtypes. Regions of 
least homology, preferably human-derived sequences were 
selected. The selected regions or fusion proteins containing 
the selected regions are used as immunogens . Hydrophobicity 
analyses of residues in selected protein regions and fusion 
proteins are also performed; regions of high hydrophobicity 
are avoided. Also, and more importantly, when preparing 
fusion proteins in bacterial hosts, rare codons are avoided. 
In particular, inclusion of 3 or more successive rare codons 
in a selected host is avoided. Numerous antibodies, 
polyclonal and monoclonal, specific for a or /? subunits types 
or subtypes have been prepared; some of these are listed in 
the following Table. Exemplary antibodies and peptide antigens 
used to prepare the antibodies are set forth in the following 
Table : 



TABLE 3 



|| SPECIFICITY 


AMINO ACID 
NUMBER 


ANTIGEN NAME 


ANTIBODY TYPE 


orl generic 


112-140 


peptide 1A#1 


polyclonal 


al generic 


1420-1447 


peptide 1A#2 


polyclonal 


ckIA generic 


1048-1208 


alA#2 (b) GST fusion" 


polyclonal 
monoclonal 


orlB generic 


983-1106 


alB#2 (b) GST fusion 


polyclonal 
monoclonal 
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<ylB-l#3 GST fusion 



g!B-2#4 GST fusion 



polyclonal 



polyclonal 



alE generic 



«1E#2 (a) GST fusion 



polyclonal 



GST gene fusion system is available from Pharmacia; see also. Smith 
et al (1988) Gene 67:31. The system provides pGEX plasmids that are 
designed for inducible, high-level expression of genes or gene 
fragments as fusions with Schistosoma japonicum GST. Upon expression 
in a bacterial host, the resulting fusion proteins are purified from 
bacterial lysates by affinity chromatography. 

The GST fusion proteins are each specific for the 
cytoplasmic loop region IIS6-IIS1, which is a region of low 
subtype homology for all subtypes, including Qf ic and a 1D , for 
which similar fusions and antisera can be prepared. 
Preparation of recombinant eukaryotic cells containing DNA 
encoding heterologous calcium channel subunits 

DNA encoding one or more of the calcium channel subunits 
or a portion of a calcium channel subunit may be introduced 
into a host cell for expression or replication of the DNA. 
Such DNA may be introduced using methods described in the 
following examples or using other procedures well known to 
those skilled in the art. Incorporation of cloned DNA into a 
suitable expression vector, transfection of eukaryotic cells 
with a plasmid vector or a combination of plasmid vectors, 
each encoding one or more distinct genes or with linear DNA, 
and selection of transfected cells are also well known in the 
art [see, e.g., Sambrook et al . (1989) Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor 
Laboratory Press] . Cloned full-length DNA encoding any of 
the subunits of a human calcium channel may be introduced into 
a plasmid vector for expression in a eukaryotic cell. Such 
DNA may be genomic DNA or cDNA. Host cells may be transfected 
with one or a combination of the plasmids, each of which 
encodes at least one calcium channel subunit. Alternatively, 
host cells may be transfected with linear DNA using methods 
well known to those of skill in the art. 

While the DNA provided herein may be expressed in any 
eukaryotic cell, including yeast cells such as P. pastoris 
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[see, e.g., Cregg et al . (1987) Bio/Technology 5:479], 
mammalian expression systems for expression of the DNA 
encoding the human calcium channel subunits provided herein 
are preferred. 

The heterologous DNA may be introduced by any method 
known to those of skill in the art, such as transfection with 
a vector encoding the heterologous DNA. Particularly preferred 
vectors for transfection of mammalian cells are the pSV2dhfr 
expression vectors, which contain the SV40 early promoter, 
mouse dhfr gene, SV40 polyadenylation and splice sites and 
sequences necessary for maintaining the vector in bacteria, 
cytomegalovirus (CMV) promoter -based vectors such as pCDNAl , 
or pc DNA -amp and MMTV promoter-based vectors. DNA encoding 
the human calcium channel subunits has been inserted in the 
vector pCDNAl at a position immediately following the CMV 
promoter. The vector pCDNAl is presently preferred. 

Stably or transiently transfected mammalian cells may be 
prepared by methods known in the art by transfecting cells 
with an expression vector having a selectable marker gene such 
as the gene for thymidine kinase, dihydrof olate reductase, 
neomycin resistance or the like, and, for transient 
transfection, growing the transfected cells under conditions 
selective for cells expressing the marker gene. Functional 
voltage -dependent calcium channels have been produced in HEK 
2 93 cells transfected with a derivative of the vector pCDNAl 
that contains DNA encoding a human calcium channel subunit. 

The heterologous DNA may be maintained in the cell as an 
episomal element or may be integrated into chromosomal DNA of 
the cell. The resulting recombinant cells may then be 
cultured or subcultured (or passaged, in the case of mammalian 
cells) from such a culture or a subculture thereof. Methods 
for transfection, injection and culturing recombinant cells 
are known to the skilled artisan. Eukaryotic cells in which 
DNA or RNA may be introduced, include any cells that are 
transferable by such DNA or RNA or into which such DNA may be 
injected. Virtually any eukaryotic cell can serve as a 
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vehicle for heterologous DNA. Preferred cells are those that 
can also express the DNA and RNA and most preferred cells are 
those that can form recombinant or heterologous calcium 
channels that include one or more subunits encoded by the 
heterologous DNA. Such cells may be identified empirically or 
selected from among those known to be readily transfected or 
injected. Preferred cells for introducing DNA include those 
that can be transiently or stably transfected and include, but 
are not limited to, cells of mammalian origin, such as COS 
cells, mouse L cells, CHO cells, human embryonic kidney cells, 
African green monkey cells and other such cells known to those 
of skill in the art, amphibian cells, such as Xenopus laevis 
oocytes, or those of yeast such as Saccharomyces cerevisiae or 
Pichia pastoris. Preferred cells for expressing injected RNA 
transcripts or cDNA include Xenopus laevis oocytes. Cells 
that are preferred for transfection of DNA are those that can 
be readily and efficiently transfected. Such cells are known 
to those of skill in the art or may be empirically identified. 
Preferred cells include DG44 cells and HEK 293 cells, 
particularly HEK 293 cells that can be frozen in liquid 
nitrogen and then thawed and regrown. Such HEK 293 cells are 
described, for example in U.S. Patent No. 5,024,939 to Gorman 
[see, also Stillman et al. (1985) Mol . Cell. Biol. 5:2051- 
2060] . 

The cells may be used as vehicles for replicating 
heterologous DNA introduced therein or for expressing the 
heterologous DNA introduced therein. In certain embodiments, 
the cells are used as vehicles for expressing the heterologous 
DNA as a means to produce substantially pure human calcium 
channel subunits or heterologous calcium channels. Host cells 
containing the heterologous DNA may be cultured under 
conditions whereby the calcium channels are expressed. The 
calcium channel subunits may be purified using protein 
purification methods known to those of skill in the art. For 
example, antibodies, such as those provided herein, that 
specifically bind to one or more of the subunits may be used 
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for affinity purification of the subunit or calcium channels 
containing the subunits. 

Substantially pure subunits of a human calcium channel 
oi 1 subunits of a human calcium channel, ot 2 subunits of a human 
calcium channel, (3 subunits of a human calcium channel and y 
subunits of a human calcium channel are provided. 
Substantially pure isolated calcium channels that contain at 
least one of the human calcium channel subunits are also 
provided. Substantially pure calcium channels that contain a 
mixture of one or more subunits encoded by the host cell and 
one or more subunits encoded by heterologous DNA or RNA that 
has been introduced into the cell are also provided. 
Substantially pure subtype- or tissue-type specific calcium 
channels are also provided. 

In other embodiments, eukaryotic cells that contain 
heterologous DNA encoding at least one of an a, subunit of a 
human calcium channel , an a 2 subunit of a human calcium 
channel, a /3 subunit of a human calcium channel and a y 
subunit of a human calcium channel are provided. in 
accordance with one preferred embodiment, the heterologous DNA 
is expressed in the eukaryotic cell and preferably encodes a 
human calcium channel otj subunit. 

Expression of heterologous calcium channels- 
electrophysiology and pharmacology 

Electrophysiological methods for measuring calcium 
channel activity are kwown to those of skill in the art and 
are exemplified herein. Any such methods may be used in order 
to detect the formation of functional calcium channels and to 
characterize the kinetics and other characteristics of the 
resulting currents. Pharmacological studies may be combined 
with the electrophysiological measurements in order to further 
characterize the calcium channels. 

With respect to measurement of the activity of functional 
heterologous calcium channels, preferably, endogenous ion 
channel activity and, if desired, heterologous channel 
activity of channels that do not contain the desired subunits, 
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of a host cell can be inhibited to a significant extent by 
chemical, pharmacological and electrophysiological means, 
including the use of differential holding potential, to 
increase the S/N ratio of the measured heterologous calcium 
channel activity. 

Thus, various combinations of subunits encoded by the DNA 
provided herein are introduced into eukaryotic cells. The 
resulting cells can be examined to ascertain whether 
functional channels are expressed and to determine the 
properties of the channels. In particularly preferred 
aspects, the eukaryotic cell which contains the heterologous 
DNA expresses it and forms a recombinant functional calcium 
channel activity. In more preferred aspects, the recombinant 
calcium channel activity is readily detectable because it is 
a type that is absent from the untransf ected host cell or is 
of a magnitude and/or pharmacological properties or exhibits 
biophysical properties not exhibited in the untransf ected 
cell. 

The eukaryotic cells can be transfected with various 
combinations of the subunit subtypes provided herein. The 
resulting cells will provide a uniform population of calcium 
channels for study of calcium channel activity and for use in 
the drug screening assays provided herein. Experiments that 
have been performed have demonstrated the inadequacy of prior 
classification schemes. 

Preferred among transfected cells is a recombinant 
eukaryotic cell with a functional heterologous calcium 
channel . The recombinant cell can be produced by introduction 
of and expression of heterologous DNA or RNA transcripts 
encoding an a x subunit of a human calcium channel, more 
preferably also expressing, a heterologous DNA encoding a p 
subunit of a human calcium channel and/or heterologous DNA 
encoding an a 2 subunit of a human calcium channel. Especially 
preferred is the expression in such a recombinant cell of each 
of the a lf 0 and a 2 subunits encoded by such heterologous DNA 
or RNA transcripts, and optionally expression of heterologous 
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DNA or an RNA transcript encoding a y subunit of a human 
calcium channel. The functional calcium channels may 

preferably include at least an a, subunit and a /3 subunit of 
a human calcium channel. Eukaryotic cells expressing these 
two subunits and also cells expressing additional subunits, 
have been prepared by transfection of DNA and by injection of 
RNA transcripts. Such cells have exhibited voltage -dependent 
calcium channel activity attributable to calcium channels that 
contain one or more of the heterologous human calcium channel 
subunits. For example, eukaryotic cells expressing 

heterologous calcium channels containing an or, subunit in 
addition to the a x subunit and a /S subunit have been shown to 
exhibit increased calcium selective ion flow across the 
cellular membrane in response to depolarization, indicating 
that the a 2 subunit may potentiate calcium channel function. 
Cells that have been co-transf ected with increasing ratios of 
a 2 to ct 1 and the activity of the resulting calcium channels has 
been measured. The results indicate that a 2 increasing the 
amount of a 3 - encoding DNA relative to the other transfected 
subunits increases calcium channel activity. 

Eukaryotic cells which express heterologous calcium 
channels containing at least a human a, subunit, a human /3 
subunit and a human o; 2 subunit are preferred. Eukaryotic 
cells transformed with a composition containing cDNA or an RNA 
transcript that encodes an a, subunit alone or in combination 
with a p and/or an a 2 subunit may be used to produce cells 
that express functional calcium channels. Since recombinant 
cells expressing human calcium channels containing all of the 
human subunits encoded by the heterologous cDNA or RNA are 
especially preferred, it is desirable to inject or transfect 
such host cells with a sufficient concentration of the 
subunit -encoding nucleic acids to form calcium channels that 
contain the human subunits encoded by heterologous DNA or RNA. 
The precise amounts and ratios of DNA or RNA encoding the 
subunits may be empirically determined and optimized for a 
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particular combination of subunits, cells and assay 
conditions . 

In particular, mammalian cells have been transiently and 
stably tranfected with DNA encoding one or more human calcium 
channel subunits. Such cells express heterologous calcium 
channels that exhibit pharmacological and electrophysiological 
properties that can be ascribed to human calcium channels. 
Such cells, however, represent homogeneous populations and the 
pharmacological and electrophysiological data provides 
insights into human calcium channel activity heretofore 
unattainable. For example, HEK cells that have been 
transiently transfected with DNA encoding the or XB -i, a 2 b» and 0 1O 
subunits. The resulting cells transiently express these 
subunits, which form calcium channels that have properties 
that appear to be a pharmacologically distinct class of 
voltage-activated calcium channels distinct from those of L- , 
N-, T- and P-type channels. The observed or 1E currents were 
insensitive to drugs and toxins previously used to define 
other classes of voltage-activated calcium channels. 

HEK cells that have been transfiently transfected with 
DNA encoding a 1B . lf a Jb , and 0 a . 2 express heterologous calcium 
channels that exhibt sensitivity to w-conotoxin and currents 
typical of N-type channels. It has been found that alteration 
of the molar raios of a 1B . a , a 2b and fi lm2 introduced into the 
cells into to achieve equivalent mRNA levels significantly 
increased the number of receptors per cell, the current 
density, and affected the K d for w-conotoxin. 

The electrophyiological properties of these channels 
produced from «„. lf a 2b , and fi x . a was compared with those of 
channels produced by transiently transfecting HEK cells with 
DNA encoding a 1B . 1( ot 2b and ^. 3 . The channels exhibited similar 
voltage dependence of activation, substantially identical 
voltage dependence, similar kinetics of activation and tail 
currents that could be fit by a single exponential. The 
voltage dependence of the kinetics of inactivation was 
significantly different at all voltages examined. 
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In certain embodiments, the eukaryotic cell with a 
heterologous calcium channel is produced by introducing into 
the cell a first composition, which contains at least one RNA 
transcript that is translated in the cell into a subunit of a 
human calcium channel. In preferred embodiments, the subunits 
that are translated include an subunit of a human calcium 
channel. More preferably, the composition that is introduced 
contains an RNA transcript which encodes an subunit of a 
human calcium channel and also contains (l) an RNA transcript 
which encodes a fi subunit of a human calcium channel and/or 
(2) an RNA transcript which encodes an a 2 subunit of a human 
calcium channel. Especially preferred is the introduction of 
RNA encoding an « 14 a fi and an a 2 human calcium channel 
subunit, and, optionally, a y subunit of a human calcium 
channel. Methods for in vitro transcription of a cloned 

DNA and injection of the resulting RNA into eukaryotic cells 
are well known in the art. Transcripts of any of the full- 
length DNA encoding any of the subunits of a human calcium 
channel may be injected alone or in combination with other 
transcripts into eukaryotic cells for expression in the cells 
Amphibian oocytes are particularly preferred for expression of 
in vitro transcripts of the human calcium channel subunit cDNA 
clones provided herein. Amphibian oocytes that express 
functional heterologous calcium channels have been produced by 
this method. 

Assays and Clinical uses of the cells and calcium channels 
Assays 

Assays for identifying compounds that 
modulate calcium channel activity 

Among the uses for eukaryotic cells which recombinantly 

express one or more subunits are assays for determining 

whether a test compound has calcium channel agonist or 

antagonist activity. These eukaryotic cells may also be used 

to select from among known calcium channel agonists and 

antagonists those exhibiting a particular calcium channel 
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subtype specificity and to thereby select compounds that have 
potential as disease- or tissue-specific therapeutic agents. 

In vitro methods for identifying compounds, such as 
calcium channel agonist and antagonists, that modulate the 
activity of calcium channels using eukaryotic cells that 
express heterologous human calcium channels are provided. 

In particular, the assays use eukaryotic cells that 
express heterologous human calcium channel subunits encoded by 
heterologous DNA provided herein, for screening potential 
calcium channel agonists and antagonists which are specific 
for human calcium channels and particularly for screening for 
compounds that are specific for particular human calcium 
channel subtypes. Such assays may be used in conjunction with 
methods of rational drug design to select among agonists and 
antagonists, which differ slightly in structure, those 
particularly useful for modulating the activity of human 
calcium channels, and to design or select compounds that 
exhibit subtype- or tissue- specific calcium channel 
antagonist and agonist activities. These assays should 
accurately predict the relative therapeutic efficacy of a 
compound for the treatment of certain disorders in humans . In 
addition, since subtype-and tissue-specific calcium channel 
subunits are provided, cells with tissue- specific or subtype- 
specific recombinant calcium channels may be prepared and used 
in assays for identification of human calcium channel tissue- 
or subtype -specific drugs. 

Desirably, the host cell for the expression of calcium 
channel subunits does not produce endogenous calcium channel 
subunits of the type or in an amount that substantially 
interferes with the detection of heterologous calcium channel 
subunits in ligand binding assays or detection of heterologous 
calcium channel function, such as generation of calcium 
current, in functional assays. Also, the host cells 

preferably should not produce endogenous calcium channels 
which detectably interact with compounds having, at 
physiological concentrations (generally nanomolar or picomolar 
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concent rat ions), affinity for calcium channels that contain 
one or all of the human calcium channel subunits provided 
herein. 

With respect to ligand binding assays for identifying a 
compound which has affinity for calcium channels, cells are 
employed which express, preferably, at least a heterologous Bl 
subunit. Transfected eukaryotic cells which express at least 
an subunit may be used to determine the ability of a test 
compound to specifically bind to heterologous calcium channels 
by, for example, evaluating the ability of the test compound 
to inhibit the interaction of a labeled compound known to 
specifically interact with calcium channels. Such ligand 
binding assays may be performed on intact transfected cells or 
membranes prepared therefrom. 

The capacity of a test compound to bind to or otherwise 
interact with membranes that contain heterologous calcium 
channels or subunits thereof may be determined by using any 
appropriate method, such as competitive binding analysis, such 
as Scatchard plots, in which the binding capacity of such 
membranes is determined in the presence and absence of one or 
more concentrations of a compound having known affinity for 
the calcium channel. where necessary, the results may be 
compared to a control experiment designed in accordance with 
methods known to those of skill in the art. For example, as 
a negative control, the results may be compared to those of 
assays of an identically treated membrane preparation from 
host cells which have not been transfected with one or more 
subunit -encoding nucleic acids. 

The assays involve contacting the cell membrane of a 
recombinant eukaryotic cell which expresses at least one 
subunit of a human calcium channel, preferably at least an a x 
subunit of a human calcium channel, with a test compound and 
measuring the ability of the test compound to specifically 
bind to the membrane or alter or modulate the activity of a 
heterologous calcium channel on the membrane. 
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In preferred embodiments, the assay uses a recombinant 
cell that has a calcium channel containing an subunit of a 
human calcium channel in combination with a P subunit of a 
human calcium channel and/or an ot 2 subunit of a human calcium 
channel. Recombinant cells expressing heterologous calcium 
channels containing each of the a a , P and o! 2 human subunits, 
and, optionally, a 7 subunit of a human calcium channel are 
especially preferred for use in such assays. 

In certain embodiments, the assays for identifying 
compounds that modulate calcium channel activity are practiced 
by measuring the calcium channel activity of a eukaryotic cell 
having a heterologous, functional calcium channel when such 
cell is exposed to a solution containing the test compound and 
a calcium channel -selective ion and comparing the measured 
calcium channel activity to the calcium channel activity of 
the same cell or a substantially identical control cell in a 
solution not containing the test compound. The cell is 
maintained in a solution having a concentration of calcium 
channel -selective ions sufficient to provide an inward current 
when the channels open. Rcombinant cells expressing calcium 
channels that include each of the a z , P and a 2 human subunits, 
and, optionally, a y subunit of a human calcium channel, are 
especially preferred for use in such assays. Methods for 
practicing such assays are known to those of skill in the art. 
For example, for similar methods applied with Xenopus laevis 
oocytes and acetylcholine receptors, see, Mishina et al . 
[(1985) .Nature 323:364] and, with such oocytes and sodium 
channels [see, Noda et al . (1986) Wature 322:826-828]. For 
similar studies which have been carried out with the 
acetylcholine receptor, see, e.g., Claudio et al . [(1987) 
Science 235:1688-1694] . 

Functional recombinant or heterologous calcium channels 
may be identified by any method known to those of skill in the 
art. For example, electrophysiological procedures for 

measuring the current across an ion-selective membrane of a 
cell, which are well known, may be used. The amount and 
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duration of the flow of calcium-selective ions through 
heterologous calcium channels of a recombinant cell containing 
DNA encoding one or more of the subunits provided herein has 
been measured using electrophysiological recordings using a 
two electrode and the whole-cell patch clamp techniques. In 
order to improve the sensitivity of the assays, known methods 
can be used to eliminate or reduce non-calcium currents and 
calcium currents resulting from endogenous calcium channels, 
when measuring calcium currents through recombinant channels. 

For example, the DHP Bay K 8644 specifically enhances L-type 
calcium channel function by increasing the duration of the 
open state of the channels [see, e.g., Hess, J.B., et al. 
(1984) Nature 311:538-544] . Prolonged opening of the channels 
results in calcium currents of increased magnitude and 
duration. Tail currents can be observed upon repolarization 
of the cell membrane after activation of ion channels by a 
depolarizing voltage command. The opened channels require a 
finite time to close or "deactivate" upon repolarization, and 
the current that flows through the channels during this period 
is referred to as a tail current. Because Bay K 8644 prolongs 
opening events in calcium channels, it tends to prolong these 
tail currents and make them more pronounced. 

In practicing these assays, stably or transiently 
transfected cells or injected cells that express voltage- 
dependent human calcium channels containing one or more of the 
subunits of a human calcium channel desirably may be used in 
assays to identify agents, such as calcium channel agonists 
and antagonists, that modulate calcium channel activity. 
Functionally testing the activity of test compounds, including 
compounds having unknown activity, for calcium channel agonist 
or antagonist activity to determine if the test compound 
potentiates, inhibits or otherwise alters the flow of calcium 
ions or other ions through a human calcium channel can be 
accomplished by (a) maintaining a eukaryotic cell which is 
transfected or injected to express a heterologous functional 
calcium channel capable of regulating the flow of calcium 
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channel -selective ions into the cell in a medium containing 
calcium channel -selective ions (i) in the presence of and (ii) 
in the absence of a test compound; (b) maintaining the cell 
under conditions such that the heterologous calcium channels 
are substantially closed and endogenous calcium channels of 
the cell are substantially inhibited (c) depolarizing the : 
membrane of the cell maintained in step (b) to an extent and 
for an amount of time sufficient to cause (preferably, 
substantially only) the heterologous calcium channels to 
become permeable to the calcium channel -selective ions; and 
(d) comparing the amount and duration of current flow into the 
cell in the presence of the test compound to that of the 
current flow into the cell, or a substantially similar cell, 
in the absence of the test compound. 

The assays thus use cells, provided herein, that express 
heterologous functional calcium channels and measure 
functionally, such as electrophysiologically, the ability of 
a test compound to potentiate, antagonize or otherwise 
modulate the magnitude and duration of the flow of calcium 
channel -selective ions, such as Ca" or Ba", through the 
heterologous functional channel. The amount of current which 
flows through the recombinant calcium channels of a cell may 
be determined directly, such as electrophysiologically, or by 
monitoring an independent reaction which occurs 
intracellularly and which is directly influenced in a calcium 
(or other) ion dependent manner. Any method for assessing 

the activity of a calcium channel may be used in conjunction 
with the cells and assays provided herein. For example, 

in one embodiment of the method for testing a compound for its 
ability to modulate calcium channel activity, the amount of 
current is measured by its modulation of a reaction which is 
sensitive to calcium channel -selective ions and uses a 
eukaryotic cell which expresses a heterologous calcium channel 
and also contains a transcriptional control element 
operatively linked for expression to a structural gene that 
encodes an indicator protein. The transcriptional control 
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element used for transcription of the indicator gene is 
responsive in the cell to a calcium channel -selective ion, 
such as Ca 2 * and Ba*. The details of such transcriptional 
based assays are described in commonly owned PCT International 
Patent Application No. PCT/US91/5625 , filed August 7, 1991, 
which claims priority to copending commonly owned allowed U.S. 
Application Serial No. 07/563,751, filed August 7, 1990; see 
also, commonly owned published PCT International Patent 
Application PCT US92/11090, which corresponds to co-pending 
U.S. Applications Serial Nos. 08/229,150 and 08/244,985. 
Assays for diagnosis of LES 

LES is an autoimmune disease characterized by an 
insufficient release of acetylcholine from motor nerve 
terminals which normally are responsive to nerve impulses. 
Immunoglobulins (IgG) from LES patients block individual 
voltage -dependent calcium channels and thus inhibit calcium 
channel activity [Kim and Neher, Science 239:405-408 (1988)] . 
A diagnostic assay for Lambert Eaton Syndrome (LES) is 
provided herein. The diagnostic assay for LES relies on the 
immunological reactivity of LES IgG with the human calcium 
channels or particular subunits alone or in combination or 
expressed on the surface of recombinant cells. For example, 
such an assay may be based on immunoprecipitation of LES IgG 
by the human calcium channel subunits and cells that express 
such subunits provided herein. 

Clinical applications 

In relation to therapeutic treatment of various disease 
states, the availability of DNA encoding human calcium channel 
subunits permits identification of any alterations in such 
genes ( e.g. . mutations) which may correlate with the 
occurrence of certain disease states. In addition, the 
creation of animal models of such disease states becomes 
possible, by epecif ically introducing such mutations into 
synthetic DNA fragments can then be introduced into laboratory 
animals or in vitro assay systems to determine the effects 
thereof . 
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Also, genetic screening can be carried out using the 
nucleotide sequences as probes. Thus, nucleic acid samples 
from subjects having pathological conditions suspected of 
involving alteration/modification of any one or more of the 
calcium channel subunits can be screened with appropriate 
probes to determine if any abnormalities exist with respect to 
any of the endogenous calcium channels. Similarly, subjects 
having a family history of disease states related to calcium 
channel dysfunction can be screened to determine if they are 
also predisposed to such disease states. 

EXAMPLES 

The following examples are included for illustrative 
purposes only and are not intended to limit the scope of the 
invention. 

EXAMPLE I: PREPARATION OF LIBRARIES USED FOR ISOLATION 

OF DNA ENCODING HUMAN NEURONAL VOLTAGE - 
DEPENDENT CALCIUM CHANNEL SUBUNITS 

A. RNA Isolation 

1. IMR32 cells 

IMR32 cells were obtained from the American Type Culture 
Collection (ATCC Accession No. CCL127, Rockville, MD) and 
grown in DMEM, 10% fetal bovine serum, 1% 
penicillin/streptomycin (GIBCO, Grand Island, NY) plus l.OmM 
dibutyryl cAMP (dbcAMP) for ten days. Total RNA was isolated 
from the cells according to the procedure described by H.C. 
Birnboim [(1988) Nucleic Acids Research 2£: 1487-1497] . 
Poly (A*) RNA was selected according to standard procedures 
[see, e.g., Sambrook et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press; pg . 
7.26-7.29] . 

2. Human thalamus tissue 

Human thalamus tissue (2.34 g) , obtained from the 
National Neurological Research Bank, Los Angeles, CA, that had 
been stored frozen at -70°C was pulverized using a mortar and 
pestle in the presence of liquid nitrogen and the cells were 
lysed in 12 ml of lysis buffer (5 M guanidinium 
isothiocyanate, 50 mM TRIS, pH 7.4, 10 mM EDTA, 5% 0- 
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mercaptoethanol ) . Lysis buffer was added to the lysate to 
yield a final volume of 17 ml. N-laurylsarcosine and CsCl 
were added to the mixture to yield final concentrations of 4% 
and 0.01 g/ml, respectively, in a final volume of 18 ml. 

The sample was centrifuged at 9,000 rpm in a Sorvall SS34 
rotor for 10 min at room temperature to remove the insoluble 
material as a pellet. The supernatant was divided into two 
equal portions and each was layered onto a 2 -ml cushion of a 
solution of 5.7 M csci, 0.1 M EDTA contained in separate 
centrifuge tubes to yield approximately 9 ml per tube. The 
samples were centrifuged in an SW41 rotor at 37,000 rpm for 24 
h at 20°C. 

After centrifugation, each RNA pellet was resuspended in 
3 ml ETS (10 mM TRIS, pH 7.4, 10 mM EDTA, 0.2% SDS) and 
combined into a single tube. The RNA was precipitated with 
0.25 M NaCl and two volumes of 95% ethanol. 

The precipitate was collected by centrifugation and 
resuspended in 4 ml PK buffer (0.05 M TRIS, pH 8.4, 0.14 M 
NaCl, 0.01 M EDTA, 1% SDS). Proteinase K was added to the 
sample to a final concentration of 200 ng/ml . The sample was 
incubated at 22 °C for l h, followed by extraction with an 
equal volume of phenol: chloroform : isoamyl alcohol (50:48:2) two 
times, followed by one extraction with an equal volume of 
chloroform: isoamyl alcohol (24:1). The RNA was precipitated 
with ethanol and NaCl. The precipitate was resuspended in 400 
Ml of ETS buffer. The yield of total RNA was approximately 
l.O mg. Poly A* RNA (30 /zg) was isolated from the total RNA 
according to standard methods as stated in Example I.A.I. 

B. Library Construction 

Double -stranded cDNA was synthesized according to 
standard methods [see, e.g., Sambrook et al . (1989) IN: 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Chapter 8] . Each library was prepared in 
substantially the same manner except for differences in: 
1) the oligonucleotide used to prime the first strand cDNA 
synthesis, 2) the adapters that were attached to the double- 
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stranded cDNA, 3) the method used to remove the free or unused 
adapters, and 4) the size of the fractionated cDNA ligated 
into the X phage vector. 

1. IMR32 cDNA library #1 
Single -stranded cDNA was synthesized using IMR32 poly (A*) 
RNA (Example I.A.I.) as a template and was primed using oligo 
(dT) 12 _ 18 (Collaborative Research Inc., Bedford, MA): The 
single -stranded cDNA was converted to double -stranded cDNA and 
the yield was approximately 2 fig. Ecol adapters: 

5 ' - AATTCGGTACGTACACTCGAGC - 3 ' = 2 2 -mer ( SEQ ID No . 1 5 ) 
3'- GCCATGCATGTGAGCTCG- 5 ' = 18 -mer (SEQ ID No. 16) 
also containing SnaBI and Xhol restriction sites were then 
added to the double -stranded cDNA according to the following 
procedure . 

a. Phosphorylation of 18 -mer 

The 18 -mer was phosphorylated using standard methods 
[see, e.g., Sambrook et al. (1989) IN: Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Chapter 8] by combining in a 10 /xl total volume the 18 -mer 
(225 pmoles) with [ 32 P] -y-ATP (7000 Ci/mmole; 1.0 /xl) and kinase 
(2 U) and incubating at 37° C for 15 minutes. After 
incubation, 1 fil 10 mM ATP and an additional 2 U of kinase 
were added and incubated at 37°C for 15 minutes. 
Kinase was then inactivated by boiling for 10 minutes. 

b. Hybridization of 22 -mer 

The 22 -mer was hybridized to the phosphorylated 18 -mer 
by addition of 225 pmoles of the 22 -mer (plus water to bring 
volume to 15 ill), and incubation at 65°C for 5 minutes. The 
reaction was then allowed to slow cool to room temperature. 

The adapters were thus present at a concentration of 15 
pmoles/Ml, and were ready for cDNA-adapter ligation. 

c. Ligation of adapters to cDNA 

After the EcoRI, SnaBI, Xhol adapters were ligated to the 
double- stranded cDNA using a standard protocol [see, e.g., 
Sambrook et al . (1989) IN: Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Chapter 8] , the 
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ligase was inactivated by heating the mixture to 72°C for 15 
minutes. The following reagents were added to the cDNA 
ligation reaction and heated at 37°C for 3 0 minutes: cDNA 
ligation reaction (20 nl) , water (24 fil) , lOx kinase buffer (3 
Ml), 10 mM ATP (1 pi) and kinase (2/il of 2 V/fil). The 
reaction was stopped by the addition of 2 /il 0.5M EDTA, 
followed by one phenol /chloroform extraction and one 
chloroform extraction. 

d. Size Selection and Packaging of cDNA 

The double -stranded cDNA with the EcdRI, SnaBI, Xhol 
adapters ligated was purified away from the free or unligated 
adapters using a 5 ml Sepharose CL-4B column (Sigma, St. 
Louis, MO) . 100 fil fractions were collected and those 
containing the cDNA, determined by monitoring the 
radioactivity, were pooled, ethanol precipitated, resuspended 
in TE buffer and loaded onto a 1% agarose gel . After the 
electrophoresis, the gel was stained with ethidium bromide and 
the 1 to 3 kb fraction was cut from the gel . The cDNA 
embedded in the agarose was eluted using the "Geneluter 
Electroelution System" (Invitrogen, San Diego, CA) . The 
eluted cDNA was collected by ethanol precipitation and 
resuspended in TE buffer at 0.10 pmol/nl. The cDNA was 
ligated to 1 jxg of EcoRI digested, dephosphorylated Xgtll in 
a 5 (il reaction volume at a 2- to 4- fold molar excess ratio 
of cDNA over the Xgtll vector. The ligated Xgtll containing 
the cDNA insert was packaged into X phage virions in vitro 
using the Gigapack (Stratagene, La Jolla, CA) kit. The 
packaged phage were plated on an E. coli Y1088 bacterial lawn 
in preparation for screening. 

2. IMR32 cDNA library #2 

This library was prepared as described (Example I.B.I.) 
with the exception that 3 to 9 kb cDNA fragments were ligated 
into the Xgtll phage vector rather than the 1 to 3 kb 
fragments . 
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3. IMR32 cDNA library #3 

IMR3 2 cell poly (A*) RNA (Example I.A.I.) was used as a 
template to synthesize single-stranded cDNA. The primers for 
the first strand cDNA synthesis were random primers 
(hexadeoxy-nucleotides [pd(N) s ] Cat #5020-1, Clontech, Palo 
Alto, CA) . The double -stranded cDNA was synthesized, EcoRI, 
SnaBI, Xhol adapters were added to the cDNA, the unligated 
adapters were removed, and the double -stranded cDNA with the 
ligated adapters was fractionated on an agarose gel, as 
described in Example I.B.I. The cDNA fraction greater than 

I. 8 kb was eluted from the agarose, ligated into Xgtli, 
packaged, and plated into a bacterial lawn of Y108 8 (as 
described in Example I.B.I.) . 

4. IMR32 CDNA library #4 

IMR32 cell poly (A*) RNA (Example I.A.I.) was used as a 
template to synthesize single-stranded cDNA. The primers for 
the first strand cDNA synthesis were oligonucleotides: 89-365a 
specific for the a 1D (VDCC III) type Oj-subunit (see Example 

II. A.) coding sequence (the complementary sequence of nt 2927 
to 2956, SEQ ID No. 1), 89-495 specific for the a lc (VDCC II) 
type c^-subunit (see Example II. B.) coding sequence (the 
complementary sequence of nt 852 to 873, SEQ ID No. 3), and 
90-12 specific for the £Y lc -subunit coding sequence (the 
complementary sequence of nt 2496 to 2520, SEQ ID No. 3) . The 
cDNA library was then constructed as described (Example 
I.B.3) , except that the cDNA size -fraction greater than 1.5 kb 
was eluted from the agarose rather than the greater than 
1.8 kb fraction. 

5. IMR32 cDNA library #5 

The cDNA library was constructed as described (Example 
I.B.3.) with the exception that the size-fraction greater than 
1.2 kb was eluted from the agarose rather than the greater 
than 1.8 kb fraction. 

6. Human thalamus cDNA library #6 

Human thalamus poly (A + ) RNA (Example I. A. 2.) was used as 
a template to synthesize single-stranded cDNA . Oligo (dT) was 
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used to prime the first strand synthesis (Example I.B.I.) . 
The double- stranded cDNA was synthesized (Example I.B.I.) and 
£coRI, Kprxl, Ncol adapters of the following sequence: 
5' CCATGGTACCTTCGTTGACG 3' = 20-mer (SEQ ID NO. 17) 
3' GGTACCATGGAAGCAACTGCTTAA 5'= 24-mer (SEQ ID NO. 18) 
were ligated to the double- stranded cDNA as described (Example 
I.B.I.) with the 20-mer replacing the 18-mer and the 24-mer 
replacing the 22-mer. The unligated adapters were removed by 
passing the cDNA-adapter mixture through a 1 ml Bio Gel A-50 
(Bio-Rad Laboratories, Richmond, CA) column. Fractions (30 
Ml) were collected and 1 fil of each fraction in the first peak 
of radioactivity was electrophoresed on a 1% agarose gel . 
After electrophoresis, the gel was dried on a vacuum gel drier 
and exposed to x-ray film. The fractions containing cDNA 
fragments greater than 600 bp were pooled, ethanol 
precipitated, and ligated into Xgtll (Example I.B.I.) . The 
construction of the cDNA library was completed as described 
(Example I.B.I. ) . 

C. Hybridization and Washing Conditions 
Hybridization of radiolabeled nucleic acids to 
immobilized DNA for the purpose of screening cDNA libraries, 
DNA Southern transfers, or northern transfers was routinely 
performed in standard hybridization conditions 
[hybridization: 50% deionized formamide, 200 /ig/ml sonicated 
herring sperm DNA (Cat #223S46, Boehringer Mannheim 
Biochemicals, Indianapolis, IN), 5 x SSPE, 5 x Denhardt ' s , 42° 
C; wash :0.2 x SSPE, 0.1% SDS, 65" C] . The recipes for SSPE 
and Denhardt 's and the preparation of deionized formamide are 
described, for example, in Sambrook et al . (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Chapter 8) . In some hybridizations, lower stringency 
conditions were used in that 10% deionized formamide replaced 
50% deionized formamide described for the standard 
hybridization conditions. 
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The washing conditions for removing the non-specific 
probe from the filters was either high, medium, or low 
stringency as described below: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50°C. 

It is understood that equivalent stringencies may be achieved 

using alternative buffers, salts and temperatures. 

EXAMPLE Hi ISOLATION OF DMA ENCODING THE HUMAN NEURONAL 
CALCIUM CHANNEL <* 3 SUB UNIT 

A. Isolation of DNA encoding the a 1D subunit 

1. Reference list of partial a lt) cDNA clones 

Numerous a 1D -specif ic cDNA clones were isolated in order 
to characterize the complete a 1D coding sequence plus portions 
of the 5' and 3' untranslated sequences. SEQ ID No. 1 shows 
the complete ot 1D DNA coding sequence, plus 510 nucleotides of 
ct 1D 5' untranslated sequence ending in the guanidine nucleotide 
adjacent to the adenine nucleotide of the proposed initiation 
of translation as well as 642 nucleotides of 3' untranslated 
sequence. Also shown in SEQ ID No. 1 is the deduced amino 
acid sequence. A list of partial cDNA clones used to 

characterize the or 1D sequence and the nucleotide position of 
each clone relative to the full-length a 1D cDNA sequence, which 
is set forth in SEQ ID No. 1, is shown below. The isolation 
and characterization of these clones are described below 
(Example II .A. 2 . ) . 

IMR32 1.144 nt 1 to 510 of SEQ ID No. 1 

5' untranslated sequence, 

nt 511 to 2431, SEQ ID No. 1 

IMR32* 1.136 nt 1627 to 2988, SEQ ID No . 1 

nt 1 to 104 of SEQ ID No. 2 

additional exon, 
IMR32® 1.80 nt 2083 to 6468, SEQ ID No. 1 

IMR32* 1.36 nt 2857 to 4281, SEQ ID No . 1 

IMR32 1.163 nt 5200 to 7635, SEQ ID No. 1 
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* 5' of nt 1627, IMR32 1.136 encodes an intron and an 
additional exon described in Example II.A.2.d. 

@ IMR32 1.80 contains two deletions, nt 2984 to 3131 
and nt 5303 to 5349 (SEQ ID No. 1) . The 148 nt 
deletion (nt 2984 to 3131) was corrected by 
performing a polymerase chain reaction described in 
Example II.A.3.b. 
# IMR32 1.36 contains a 132 nt deletion (nt 3081 to 
3212) . 

2 . Isolation and characterization of individual 
clones listed in Example II. A. 1. 

a. IMR32 1-36 

Two million recombinants of the IMR32 cDNA library #1 
(Example I.B.I.) were screened in duplicate at a density of 
approximately 200,000 plaques per 150 mm plate using a mixture 
of radiolabelled fragments of the coding region of the rabbit 
skeletal muscle calcium channel Oj cDNA [for the sequence of 
the rabbit skeletal muscle calcium channel subunit cDNA, 
see, Tanabe et al. (1987). .Nature 32S:313- 

318] : Fragment Nucleotides 

XpnI-£coRI -78 to 1006 

EcoRI-XhoI 1006 to 2653 

Apal-Apal 3093 to 4182 

Bglll-SacI 4487 to 5310 

The hybridization was performed using low stringency 
hybridization conditions (Example I.e.) and the filters were 
washed under low stringency (Example I.C.). Only one or 1D - 
specific recombinant (IMR32 1.36) of the 2 x 10 £ screened was 
identified. IMR32 1.36 was plaque purified by standard 
methods (J. Sambrook et al . (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Chapter 8) subcloned into pGEM3 (Promega, Madison, WI) and 
characterized by DNA sequencing. 

b. IMR32 1.80 

Approximately 1 x 10 6 recombinants of the IMR32 cDNA 
library #2 (Example I.B.2.) were screened in duplicate at a 
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density of approximately 100,000 plaques per 150 mm plate 
using the IMR32 1.36 cDNA fragment (Example II. A. 1) as a 
probe. Standard hybridization conditions were used, and the 
filters were washed under high stringency (Example I.e.). 
Three positive plaques were identified one of which was IMR32 
1.80. IMR32 1.80 was plaque purified by standard methods, 
restriction mapped, subcloned, and characterized by DNA 
sequencing . 

c. IMR32 1.144 

Approximately l x 10 6 recombinants of the IMR32 cDNA 
library #3 (Example I.B.3) were screened with the EcoRI-PvuII 
fragment (nt 2083 to 2518, SEQ ID No. 1) of IMR32 1.80. The 
hybridization was performed using standard hybridization 
conditions (Example I.C.) and the filters were washed under 
high stringency (Example I.e.). Three positive plagues were 
identified one of which was IMR32 1.144. IMR32 1.144 was 
plague purified, restriction mapped, and the cDNA insert was 
subcloned into pGEM7Z (Promega, Madison, WI) and characterized 
by DNA sequencing. This characterization revealed that IMR32 
1.144 has a series of ATG codons encoding seven possible 
initiating methionines (nt 511 to 531, SEQ ID No. 1) . Nucleic 
acid amplification analysis, and DNA sequencing of cloned 
nucleic acid amplification analysis products encoding these 
seven ATG codons confirmed that this sequence is present in 
the a 1D transcript expressed in dbcAMP- induced IMR32 cells. 

d. IMR32 1.136 

Approximately 1 x 10 6 recombinants of the IMR32 cDNA 
library #4 (Example I.B.4) were screened with the EcoRI-PvuII 
fragment (nt 2083 to 2518, SEQ ID No. 1) of IMR32 1.80 
(Example II. A. 1.). The hybridization was performed using 
standard hybridization conditions (Example I.C.) and the 
filters were washed under high stringency (Example I.C.) . Six 
positive plaques were identified one of which was IMR32 1.136. 
IMR32 1.136 was plaque purified, restriction mapped, and the 
cDNA insert was subcloned into a standard plasmid vector, 
pSP72 (Promega, Madison, Wi . ) , and characterized by DNA 
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sequencing. This characterization revealed that IMR32 1.136 
encodes an incompletely spliced a 1D transcript. The clone 
contains nucleotides 1627 to 2988 of SEQ ID No. 1 preceded by 
an approximate 640 bp intron. This intron is then preceded by 
a 104 nt exon (SEQ ID No. 2) which is an alternative exon 
encoding the IS6 transmembrane domain [see, e.g., Tanabe et 
al. (1987) .Nature 325:313-318 for a description of the IS1 to 
IVS6 transmembrane terminology] of the of 1D subunit and can 
replace nt 1627 to 173 o, SEQ ID No. 1, to produce a completely 
spliced a 1D transcript. 

e. XMR32 1.163 
Approximately 1 x 10 6 recombinants of the IMR3 2 cDNA 
library #3 (Example I.B.3.) were screened with the Wcol-Xhol 
fragment of IMR32 1.80 (Example II.A.l.) containing nt 5811 to 
6468 (SEQ ID No. 1) . The hybridization was performed using 
standard hybridization conditions (Example I.e.) and the 
filters were washed under high stringency (Example I.C.). 
Three positive plagues were identified one of which was IMR32 
1.163. IMR32 1.163 was plaque purified, restriction mapped, 
and the cDNA insert was subcloned into a standard plasmid 
vector, pSP72 (Promega, Madison, WI . ) , and characterized by 
DNA sequencing. This characterization revealed that IMR32 
1.163 contains the a 1D termination codon, nt 6994 to 6996 (SEQ 
ID No. 1) . 

3. Construction of a full-length a XD cDNA 
tpVDCCIII (A) ] 

Of 1D cDNA clones IMR32 1.144, IMR32 1.136, IMR32 1.80, and 
IMR32 1.163 (Example II. A. 2.) overlap and include the entire 
a lv coding sequence, nt 511 to 6993 (SEQ ID No. 1), with the 
exception of a 148 bp deletion, nt 2984 to 3131 (SEQ ID No. 
1) . Portions of these partial cDNA clones were ligated to 
generate a full-length a 1D cDNA in a eukaryotic expression 
vector. The resulting vector was called pVDCCIII (A) . The 
construction of pVDCCIII(A) was performed in four steps 
described in detail below: (l) the construction of 

pVDCCIII/5' using portions of IMR32 1.144, IMR32 1.136, and 
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IMR32 1.80, (2) the construction of pVDCCIIl/5 ' .3 that 
corrects the 148 nt deletion in the IMR32 1.80 portion of 
pVDCCIII/5', (3) the construction of pVDCCII 1/3 ' . 1 using 
portions of IMR32 1.8 0 and IMR32 1.163, and (4) the ligation 
of a portion of the pVDCCIII/5 ' . 3 insert, the insert of 
pVDCCIIl/3' .1, and pcDNAl (Invitrogen, San Diego, CA) to form 
pVDCCIII(A). The vector pcDNAl is a eukaryotic expression 
vector containing a cytomegalovirus (CMV) promoter which is a 
constitutive promoter recognized by mammalian host cell RNA 
polymerase II. 

Each of the DNA fragments used in preparing the full- 
length construct was purified by electrophoresis through an 
agarose gel onto DE81 filter paper {Whatman, Clifton, NJ) and 
elution from the filter paper using 1.0 M NaCl , 10 mM TRIS, pH 
8.0, l mM EDTA. The ligations typically were performed in a 
10 nl reaction volume with an equal molar ratio of insert 
fragment and a two-fold molar excess of the total insert 
relative to the vector. The amount of DNA used was normally 
about 50 ng to 100 ng. 

a. pVDCCIII/5' 

To construct pVDCCIII/5', IMR32 1.144 (Example II.A.2.C.) 
was digested with Xhol and EcoRl and the fragment containing 
the vector (pGEM7Z) , a 1D nt 1 to 510 (SEQ ID No. l) , and cx 1D nt 
511 to 1732 (SEQ ID No. 1) was isolated by gel 
electrophoresis. The EcoRI-Apal fragment of IMR32 1.136 
(Example II.A.2.d.) nucleotides 1733 to 2671 (SEQ ID No. 1) 
was isolated, and the Apal-Hindlll fragment of IMR32 1.80 
(Example II.A.2.b.), nucleotides 2672 to 4492 (SEQ ID No. 1) 
was isolated. The three DNA clones were ligated to form 
pVDCCIII/5' containing nt 1 to 510 (5' untranslated sequence; 
SEQ ID No. 1) and nt 511 to 4492 (SEQ ID No. 1) . 

b. pVDCCIII/5' .3 

Comparison of the IMR32 1.36 and IMR32 1.80 DNA sequences 
revealed that these two cDNA clones differ through the a 1D 
coding sequence, nucleotides 2984 to 3212. nucleic acid 
amplification analysis of IMR32 1.80 and dbcAMP- induced 
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(1.0 mM, 10 days) IMR32 cytoplasmic RNA (isolated according to 
Ausubel, F.M. et al. (Eds) (1988) Current Protocols in 
Molecular Biology, John Wiley and Sons, New York) revealed 
that IMR32 1.80 had a 148 nt deletion, nt 2984 to 3131 (SEQ ID 
No. 1), and that IMR32 1.36 had a 132 nt deletion, nt 3081 to 
3212. To perform the nucleic acid amplification analysis, the 
amplification reaction was primed with a 1D -specific 
oligonucleotides 112 (nt 2548 to 2572, SEQ ID No. 1) and 311 

(the complementary sequence of nt 3928 to 3 957, SEQ ID No. 1) . 
These products were then reamplified using ct 1D - specific 
oligonucleotides 310 (nt 2583 to 2608 SEQ ID No. 1) and 312 

(the complementary sequence of nt 3883 to 3909) . This 
reamplified product, which contains AccI and Bglll restriction 
sites, was digested with AccI and Bglll and the AccI -Bglll 
fragment, nt 2765 to 3890 (SEQ ID No. 1) was cloned into Accl- 
Bglll digested pVDCCIII/5' to replace the AccI -Bglll 
pVDCClll/5' fragment that had the deletion. This new 
construct was named pVDCCIII/5' .3. DNA sequence determination 
of pVDCCIII/5' .3 through the amplified region confirmed the 
148 nt deletion in IMR32 1.80. 

C. pVDCCIII/3' .1 
To construct pVDCCIII/3 ' . 1 , the cDNA insert of IMR32 
1.163 (Example II.A.2.e.) was subcloned into pBluescript II 

(Stratagene, La Jolla, CA) as an Xhol fragment. The Xhol 
sites on the cDNA fragment were furnished by the adapters used 
to construct the cDNA library (Example I.B.3.). The insert 
was oriented such that the translational orientation of the 
insert of IMR32 1.163 was opposite to that of the lacZ gene 
present in the plasmid, as confirmed by analysis of 
restriction enzyme digests of the resulting plasmid. This was 
done to preclude the possibility of expression of or 1D sequences 
in DH5of cells transformed with this plasmid due to fusion with 
the lacZ gene. This plasmid was then digested with Hindlll 
and Bglll and the Hindlll - Bglll fragment (the Hindlll site 
comes from the vector and the Bglll site is at nt 6220, SEQ ID 
No. 1) was eliminated, thus deleting nt 5200 to 6220 (SEQ ID 
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No. 1) of the IMR32 1.163 clone and removing this sequence 
from the remainder of the plasmid which contained the 3' Bglll 
Xhol fragment, nt 6221 to 7635 (SEQ ID No. 1) . 
pVDCCIII/3' .1 was then made by splicing together the Hindlll- 
PvuII fragment from IMR32 1.80 (nucleotides 4493-5296, SEQ ID 
No. 1), the PvuII - Bglll fragment of IMR32 1.163 (nucleotides 
5294 to 6220, SEQ ID No. 1) and the Hindu I -Bgl II -digested 
pBluescript plasmid containing the 3' Bglll/ Xhol IMR32 1.163 
fragment (nt 6221 to 7635, SEQ ID No. 1) . 

d. pVDCCIII (A) : the full-length o 1D 
construct 

To construct pVDCCIII(A), the Dral-Hindlll fragment (5' 
untranslated sequence nt 330 to 510, SEQ ID No. 1 and coding 
sequence nt 511 to 4492, SEQ ID No. 1) of pVDCCIII/5 ' . 3 
(Example II.A.3.b.) was isolated; the Hindlll-Xhol fragment of 
pVDCCIII/3 ' . 1 (containing nt 4493 to 7635, SEQ ID No. 1, plus 
the Xhol site of the adapter) (Example II.A.3.C.) was 
isolated; and the plasmid vector, pcDNAl, was digested with 
EcoRV and Xhol and isolated on an agarose gel . The three DNA 
fragments were ligated and MC1061-P3 (Invitrogen, San Diego, 
CA) was transformed. Isolated clones were analyzed by 
restriction mapping and DNA sequencing and pVDCCIII (A) was 
identified which had the fragments correctly ligated together: 
Dral-Jfindlll, Hindi 1 1 -Xhol, XhoI-EcoRV with the blunt-end Dral 
and EcoRV site ligating together to form the circular plasmid. 

The amino -terminus of the a 1D subunit is encoded by the 
seven consecutive 5' methionine codons (nt 511 to 531, SEQ ID 
No. 1) . This 5' portion plus nt 532 to 53 7, 

encoding two lysine residues, were deleted from pVDCCIII (A) 
and replaced with an efficient ribosomal binding site (5'- 
ACCACC-3') to form pVDCCIII . RBS (A) . Expression experiments in 
which transcripts of this construct were injected into Xenopus 
laevis oocytes did not result in an enhancement in the 
recombinant voltage-dependent calcium channel expression level 
relative to the level of expression in oocytes injected with 
transcripts of pVDCCIII (A) . 
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B. Isolation of DNA encoding the <v lc subunit 

1. Reference List of Partial a lc cDNA clones 

Numerous a ic - specif ic cDNA clones were isolated in order 
to characterize the a lc coding sequence, the a lc initiation of 
translation, and an alternatively spliced region of a lc . SEQ 
ID No. 3 sets forth one a ic coding sequence (c^d) and deduced 
amino acid sequence; SEQ ID No. 36 sets forth another splice 
variant designated a lc _ 2 . SEQ ID No. 4 and No. 5 encode two 
possible amino terminal ends of an a ic splice variant. SEQ ID 
No. 6 encodes an alternative exon for the IV S3 transmembrane 
domain. Other a lc variants can be constructed by selecting the 
alternative amino terminal ends in place of the ends in SEQ ID 
No. 3 or 36 and/or inserting the alternative exon (SEQ ID No. 
6) in the appropriate location, such as in SEQ ID NO. 3 in 
place of nucleotides 3904-3987. In addition, the 75 
nucleotide sequence (nucleotides 1391-1465 in SEQ ID No. 3) 
can be deleted or inserted to produce an alternative a ic splice 
variant . 

Shown below is a list of clones used to characterize the 
a ic sequence and the nucleotide position of each clone relative 
to the characterized a ic sequence (SEQ ID No. 3) . The 
isolation and characterization of these cDNA clones are 
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below 
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SEQ ID No. 6 

* IMR32 1.86 has a 73 nt deletion compared to the rabbit 
cardiac muscle calcium channel a 1 subunit cDNA sequence. 
e 1.16G is an a lc genomic clone. 

2 . Isolation and characterization of 

clones described in Example II.B.l. 
a. CNS 1.30 

Approximately 1 x 10 6 recombinants of the human thalamus 
cDNA library No. 6 (Example I.B.6.) were screened with 
fragments of the rabbit skeletal muscle calcium channel ot 1 
cDNA described in Example II. A. 2. a. The hybridization was 
performed using standard hybridization conditions (Example 
I.C.) and the filters were washed under low stringency 
(Example I.e.). six positive plagues were identified, one of 
which was CNS 1.30. CNS 1.30 was plague purified, restriction 
mapped, subcloned, and characterized by DNA sequencing. CNS 
1.30 encodes of lc -specif ic sequence nt 2199 to 3903 (SEQ ID No. 
3) followed by nt 1 to 84 of one of two identified alternative 
or lc exons (SEQ ID No. 6). 3' of SEQ ID No. 6, CNS 1.30 
contains an intron and, thus, CNS 1.30 encodes a partially 
spliced a lc transcript. 

b. 1.16G 

Approximately 1 x 10* recombinants of a XEMBL3 -based 
human genomic DNA library (Cat # HL1006d Clontech Corp., Palo 
Alto, CA) were screened using a rabbit skeletal muscle cDNA 
fragment (nt -78 to 1006, Example II. A. 2. a.). The 
hybridization was performed using standard hybridization 
conditions (Example I.C.) and the filters were washed under 
low stringency (Example I.C.) . Fourteen positive plaques were 
identified, one of which was 1.16G. Clone 1.16G was plaque 
purified, restriction mapped, subcloned, and portions were 
characterized by DNA sequencing. DNA sequencing revealed that 
1.16G encodes a ic -specific sequence as described in Example 
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c. IMR32 1.66 and IMR32 1.67 

Approximately 1 x 10 s recombinants of IMR32 cDNA library 
#5 (Example I.B.5.) were screened with a 151 bp KpnI-SacI 
fragment of 1.1 6G (Example II.B.2.b.) encoding or lc sequence (nt 
758 to 867, SEQ ID No . 3) . The hybridization was performed 
using standard hybridization conditions (Example I.e.). The 
filters were then washed in 0.5 x SSPE at 65°C. Of the 
positive plaques, IMR32 1.66 and IMR32 1.67 were identified. 
The hybridizing plaques were purified, restriction mapped, 
subcloned, and characterized by DNA sequencing. Two of these 
cDNA clones, IMR32 1.66 and 1.67, encode a lc subunits as 
described (Example II.B.l.). In addition, IMR32 1.66 encodes 
a partially spliced a ic transcript marked by a GT splice donor 
dinucleotide beginning at the nucleotide 3' of nt 916 (SEQ ID 
No. 3). The intron sequence within 1.66 is 101 nt long. 
IMR32 1.66 encodes the a ic initiation of translation, nt 1 to 
3 (SEQ ID No. 3) and 132 nt of 5' untranslated sequence (SEQ 
ID No. 4) precede the start codon in IMR32 1.66. 

d. IMR32 1.37 and IMR32 1.38 
Approximately 2 x 10 6 recombinants of IMR32 cDNA library 

#1 (Example I.B.I.) were screened with the CNS 1.30 cDNA 
fragment (Example II.B.2.a.) . The hybridization was performed 
using low stringency hybridization conditions (Example I.C.) 
and the filters were washed under low stringency (Example 
I.C). Four positive plaques were identified, plaque 

purified, restriction mapped, subcloned, and characterized by 
DNA sequencing. Two of the clones, IMR32 1.37 and IMR32 1.38 
encode a ic -specif ic sequences as described in Example II.B.l. 

DNA sequence comparison of IMR32 1.37 and IMR32 1.38 
revealed that the a ic transcript includes two exons that encode 
the IVS3 transmembrane domain. IMR32 1.37 has a single exon, 
nt 3904 to 3987 (SEQ ID No. 3) and IMR32 1.38 appears to be 
anomalously spliced to contain both exons juxtaposed, nt 3904 
to 3987 (SEQ ID No. 3) followed by nt 1 to 84 (SEQ ID No. 6) . 
The alternative splice of the a ic transcript to contain either 
of the two exons encoding the IVS3 region was confirmed by 
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comparing the CNS 1.30 sequence to the IMR32 1.37 sequence. 
CNS 1.30 contains nt 1 to 84 (SEQ ID No. 6) preceded by the 
identical sequence contained in IMR32 1.37 for nt 2199 to 3903 
(SEQ ID No. 3) . As described in Example II.B.2.a., an intron 
follows nt 1 to 84 (SEQ ID No. 6) . Two alternative exons have 
been spliced adjacent to nt 3903 (SEQ ID No. 3) represented by 
CNS 1.30 and IMR32 1.37. 

e. IMR32 1.86 
IMR32 cDNA library #1 (Example I.B.I.) was screened in 
duplicate using oligonucleotide probes 90-9 (nt 1462 to 1491, 
SEQ ID No. 3) and 90-12 (nt 2496 to 2520, SEQ ID No. 3) . 
These oligonucleotide probes were chosen in order to isolate 
a clone that encodes the or lc subunit between the 3' end of 
IMR32 1.67 (nt 1717, SEQ ID No. 3) and the 5' end of CNS 1.3 0 
(nt 2199, SEQ ID No. 3) . The hybridization conditions were 
standard hybridization conditions (Example I.C.) with the 
exception that the 50% deionized f ormamide was reduced to 20% . 
The filters were washed under low stringency (Example I.C.) . 
Three positive plaques were identified one of which was IMR32 

I. 86. IMR32 1.86 was plaque purified, subcloned, and 
characterized by restriction mapping and DNA sequencing. 
IMR32 1.86 encodes a ic sequences as described in Example 

II. B.l. Characterization by DNA sequencing revealed that 
IMR32 1.86 contains a 73 nt deletion compared to the DNA 
encoding rabbit cardiac muscle calcium channel a x subunit 
[Mikami et al . (1989) Nature 340:230], nt 2191 to 2263. These 
missing nucleotides correspond to nt 2176-2248 of SEQ ID No. 
3. Because the 5' -end of CNS 1.30 overlaps the 3 '-end of 
IMR32 1.86, some of these missing nucleotides, i.e., nt 2205- 
2248 of SEQ ID No. 3, are accounted for by CNS 1.30. The 
remaining missing nucleotides of the 73 nucleotide deletion in 
IMR32 1.86 (i.e., nt 2176-2204 SEQ ID No . 3) were determined 
by nucleic acid amplification analysis of dbcAMP- induced IMR32 
cell RNA. The 73 nt deletion is a frame-shift mutation and, 
thus, needs to be corrected. The exact human sequence through 
this region, (which has been determined by the DNA sequence of 



WO 95/04822 



PCT/US94/09230 



-70- 

CNS 1.30 and nucleic acid amplification analysis of IMR32 cell 
RNA) can be inserted into IMR32 1.86 by standard methods, 
e.g., replacement of a restriction fragment or site-directed 
mutagenesis . 

f. IMR32 1.157 

One million recombinants of IMR32 cDNA library #4 
(Example I.B.4.) were screened with an XhoI-EcoRI fragment of 
IMR32 1.67 encoding ff ic nt 50 to 774 (SEQ ID No. 3). The 
hybridization was performed using standard hybridization 
conditions (Example I.C.) - The filters were washed under high 
stringency (Example I.C). One of the positive plaques 
identified was IMR32 1.157. This plaque was purified, the 
insert was restriction mapped and subcloned to a standard 
plasmid vector pGEM7Z (Promega, Madison, WI) . The DNA was 
characterized by sequencing. IMR32 1.157 appears to encodes 
an alternative 5' portion of the a ic sequence beginning with 
nt 1 to 89 (SEQ ID No. 5) and followed by nt 1 to 873 (SEQ ID 
No. 3) . Analysis of the 1.66 and 1.157 5' sequence is 
described below (Example II. B. 3.). 

3. Characterization of the e lc initiation of 
translation site 

Portions of the sequences of IMR32 1.157 (nt 57 to 89, 
SEQ ID No. 5; nt 1 to 67, SEQ ID No. 3), IMR32 1.66 (nt 100 to 
132, SEQ ID No. 4; nt 1 to 67, SEQ ID No . 3), were compared to 
the rabbit lung CaCB-receptor cDNA sequence, nt -33 to 67 
[Biel et al . (1990) FEES Lett. 269:409] . The human sequences 
are possible alternative 5' ends of the a lc transcript encoding 
the region of initiation of translation. IMR32 1.66 closely 
matches the CaCB receptor cDNA sequence and diverges from the 
CaCB receptor cDNA sequence in the 5' direction beginning at 
nt 122 (SEQ ID No. 4) . The start codon identified in the CaCB 
receptor cDNA sequence is the same start codon used to 
describe the oi lc coding sequence, nt 1 to 3 (SEQ ID No. 3) . 

The sequences of a ic splice variants, designated a^.^ and 
a lc _ 2 are set forth in SEQ ID NOs . 3 and 36. 
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C. Isolation of partial cDNA clones encoding the or 1B 
subunit and construction of a full-length clone 

A human basal ganglia cDNA library was screened with the 
rabbit skeletal muscle subunit cDNA fragments (see Example 
II. A. 2. a for description of fragments) under low stringency- 
conditions. One of the hybridizing clones was used to screen 
an IMR32 cell cDNA library to obtain additional partial a 1B 
cDNA clones, which were in turn used to further screen an 
IMR32 cell cDNA library for additional partial cDNA clones. 
One of the partial IMR32 Qf 1B clones was used to screen a human 
hippocampus library to obtain a partial a 1B clone encoding the 
3' end of the a 1B coding sequence. The sequence of some of the 
regions of the partial cDNA clones was compared to the 
sequence of products of nucleic acid amplification analysis of 
IMR32 cell RNA to determine the accuracy of the cDNA 
sequences . 

Nucleic acid amplification analysis analysis of IMR32 
cell RNA and genomic DNA using oligonucleotide primers 
corresponding to sequences located 5' and 3' of the STOP codon 
of the DNA encoding the a 1B subunit revealed an alternatively 
spliced ar 1B -encoding mRNA in IMR32 cells. This second mRNA 
product is the result of differential splicing of the a lB 
subunit transcript to include another exon that is not present 
in the mRNA corresponding to the other 3 ' ot 1B cDNA sequence 
that was initially isolated. To distinguish these splice 
variants of the a 1B subunit, the subunit encoded by a DNA 
sequence corresponding to the form containing the additional 
exon is referred to as a^.j (SEQ ID No. 7) , whereas the subunit 
encoded by a DNA sequence corresponding to the form lacking 
the additional exon is referred to as a 1B . 2 (SEQ ID No. 8) . The 
sequence of oi^.! diverges from that of a XB _ 2 beginning at nt 
6633 (SEQ ID No. 7) . Following the sequence of the additional 
exon in ot^^ (nt 6633-6819; SEQ ID No. 7) , the a 1B . a and a 1B _ 2 
sequences are identical (i.e., nt 6820-7362 in SEQ ID No. 7 
and nt 6633-7175 in SEQ ID No. 8) . SEQ ID No. 7 and No. 8 set 
forth 143 nt of 5' untranslated sequence (nt 1-143) as well as 
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202 nt of 3' untranslated sequence (nt 7161-7362, SEQ ID No. 
7) of the DNA encoding and 321 nt of 3' untranslated 

sequence (nt 6855-7175, SEQ ID No. 8) of the DNA encoding a 1B . 2 . 

Nucleic acid amplification analysis analysis of the IS6 
region of the or 1B transcript revealed what appear to be 
additional splice variants based on multiple fragment sizes 
seen on an ethidium bromide-stained agarose gel containing the 
products of the amplification reaction. 

A full-length a 1B .! cDNA clone designated pcDNA-a 1B .! was 
prepared in an eight-step process as follows. 

STEP 1: The Sad restriction site of pGEM3 (Promega, 
Madison, WI) was destroyed by digestion at the SacI 
site, producing blunt ends by treatment with T4 DNA 
polymerase, and religation. The new vector was 
designated pGEMASac. 

STEP 2: Fragment 1 (Hindlll/Kpnl ; nt 2337 to 4303 of SEQ ID 
No. 7) was ligated into Hindlll/Kpnl digested 
pGEM3 ASac to produce pal.l77HK. 

STEP 3: Fragment 1 has a 2 nucleotide deletion (nt 3852 and 
3 853 of SEQ ID No . 7) . The deletion was repaired 
by inserting an amplfied fragment (fragment 2) of 
IMR32 RNA into pal.l77HK. Thus, fragment 2 
(Narl/Kpnl; nt 3828 to 4303 of SEQ ID No. 7) was 
inserted into Narl/Kpnl digested porl.l77HK 
replacing the Narl/Kpnl portion of fragment 1 and 
producing pal . 177HK/PCR . 

STEP 4: Fragment 3 {KpnI/Kpnl ; nt 4303 to 5663 of SEQ ID 
No. 7) was ligated into Kpnl digested pari . 177HK/PCR 
to produce palB5'K. 

STEP 5: Fragment 4 ( EcoRI / Hindi II; £coRI adaptor plus nt 1 
to 2337 of SEQ ID No. 7) and fragment 5 
(Hindlll/Xhol fragment of palBS'K; nt 2337 to 5446 
of SEQ ID No. 7) were ligated together into 
£coRI/XhoI digested pcDNAl (Invitrogen, San Diego, 
CA) to produce porlB5 ' . 
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STEP 6: Fragment 6 (EcoRl/EcoRI ; EcoRI adapters on both 
ends plus nt 5749 to 7362 of SEQ ID No . 7) was 
ligated into .EcoRI digested pBluescript II KS 
(Stratagene, La Jolla, CA) with the 5' end of the 
fragment proximal to the Kpnl site in the 
polylinker to produce pal. 230. 

STEP 7: Fragment 7 (Kpnl/Xhol; nt 4303 to 5446 of SEQ ID 
No. 7), and fragment 8 (Xhol/Cspl; nt 5446 to 6259 
of SEQ ID No. 7) were ligated into Kpnl/Cspl 
digested pal. 230 (removes nt 5749 to 6259 of SEQ ID 
No. 7 that was encoded in pal. 23 0 and maintains nt 
6259 to 7362 of SEQ ID No. 7) to produce palB3 ' . 

STEP 8: Fragment 9 (Sphl/Xhol; nt 4993 to 5446 of SEQ ID 
No. 7) and fragment 10 (Xhol/Xbal of palB3 ' ; nt 
5446 to 7319 of SEQ ID No. 7) were ligated into 
Sphl/Xbal digested palB5' (removes nt 4993 to 5446 
of SEQ ID No. 7 that were encoded in palB5' and 
maintains nt 1 to 4850 of SEQ ID No. 7) to produce 
pcDNAa 1B _! . 

The resulting construct, pcDNAa 1B . 1( contains, in pCDNAl , 
a full-length coding region encoding a 1B _i (nt 144-7362, SEQ ID 
No. 7), plus 5' untranslated sequence (nt 1-143, SEQ ID No. 7) 
and 3' untranslated sequence (nt 7161-7319, SEQ ID No. 7) 
under the transcriptional control of the CMV promoter. 

D. isolation of DNA encoding human calcium channel a 1A 
subunits 

1. Isolation of partial clones 

DNA clones encoding portions of human calcium channel a lfi 
subunits were obtained by hybridization screening of human 
cerebellum cDNA libraries and nucleic acid amplification of 
human cerebellum RNA. Clones corresponding to the 3' end of 
the a 1A coding sequence were isolated by screening 1 x 10 £ 
recombinants of a randomly primed cerebellum cDNA library 

(size-selected for inserts greater than 2.8 kb in length) 
under low stringency conditions (6X SSPE, 5X Denhart's 

solution, 0.2% SDS, 200 pg/ml sonicated herring sperm DNA, 
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42 °C) with oligonucleotide 704 containing nt 6190-6217 of the 
rat a 1A coding sequence [Starr et al . (1992) Proc . Natl. Acad. 
Sci . U.S.A. S8: 5621-5625] . Washes were performed under low 
stringency conditions. Several clones that hybridized to the 
probe {clones al . 251-al .259 and al.244) were purified and 
characterized by restriction enzyme mapping and DNA sequence 
analysis. At least two of the clones, al.244 and al.254, 
contained a translation termination codon. Although clones 
al.244 and al.254 are different lengths, they both contain a 
sequence of nucleotides that corresponds to the extreme 3' end 
of the a 1A transcript, i.e. , the two clones overlap. These two 
clones are identical in the region of overlap, except, clone 
al.244 contains a sequence of 5 and a sequence of 12 
nucleotides that are not present in al.254. 

To obtain additional a 1A -encoding clones, 1 x 10 6 
recombinants of a randomly primed cerebellum cDNA library 
(size-selected for inserts ranging from 1.0 to 2.8 kb in 
length) was screened for hybridization to three 
oligonucleotides: oligonucleotide 701 (containing nucleotides 
2288-2315 of the rat b u coding sequence) , oligonucleotide 702 
(containing nucleotides 3559-3585 of the rat a 1A coding 
sequence) and oligonucleotide 703 (containing nucleotides 
4798-4827 of the rat a 1A coding sequence) . Hybridization and 
washes were performed using the same conditions as used for 
the first screening with oligonucleotide 704, except that 
washes were conducted at 45°C. Twenty clones (clones al.269- 
al.288) hybridized to the probe. Several clones were plaque- 
purified and characterized by restriction enzyme mapping and 
DNA sequence analysis. One clone, al.2 79, contained a 
sequence of about 170 nucleotides that is not present in other 
clones corresponding to the same region of the coding 
sequence. This region may be present in other splice 
variants. None of the clones contained a translation 
intiation codon. 

To obtain clones corresponding to the 5' end of the human 
a 1A coding sequence, another cerebellum cDNA library was 
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prepared using oligonucleotide 720 (containing nucleotides 
2485-2510 of SEQ ID No. 22) to specifically prime first-strand 
cDNA synthesis. The library (8 x 10 5 recombinants) was 

screened for hybridization to three oligonucleotides: 
oligonucleotide 701, oligonucleotide 726 (containing 
nucleotides 2333-2360 of the rat a lfl coding sequence) and 
oligonucleotide 700 (containing nucleotides 767-796 of the rat 
a 1A coding sequence) under low stringency hybridization and 
washing conditions. Approximately 50 plaques hybridized to 
the probe. Hybridizing clones al . 38l-al . 390 were plaque- 
purified and characterized by restriction enzyme maping and 
DNA sequence analysis. At least one of the clones, al.381, 
contained a translation initiation codon. 

Alignment of the sequences of the purified clones 
revealed that the sequences overlapped to comprise the entire 
a 1A coding sequence. However, not all the overlapping 
sequences of partial clones contained convenient enzyme 
restriction sites for use in ligating partial clones to 
construct a full-length a„ coding sequence. To obtain DNA 
fragments containing convenient restriction enzyme sites that 
could be used in constructing a full-length a 1A DNA, cDNA was 
synthesized from RNA isolated from human cerebellum tissue and 
subjected to nucleic acid amplification. The oligonucleotides 
used as primers corresponded to human or 1A coding sequence 
located 5' and 3' of selected restriction enzyme sites. Thus, 
in the first amplification reaction, oligonucleotides 753 
(containing nucleotides 2368-2391 of SEQ ID No. 22) and 72 8 
(containing nucleotides 3179-3202 of SEQ ID No. 22) were used 
as the primer pair. To provide a sufficient amount of the 
desired DNA fragment, the product of this amplification was 
reamplified using oligonucleotides 753 and 754 (containing 
nucleotides 3112-3135 of SEQ ID No. 22 as the primer pair. 
The resulting product was 768 bp in length. In the second 
amplification reaction, oligonucleotides 719 (containing 
nucleotides 4950-4975 of SEQ ID No. 22 and 752 (containing 
nucleotides 5647-5670 of SEQ ID No. 22) were used as the 
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primer pair. To provide a sufficient amount of the desired 
second DNA fragment, the product of this amplification was 
reamplif ied using oligonucleotides 756 (containing nucleotides 
5112-5135 of SEQ ID No. 22) and 752 as the primer pair. The 
resulting product was 559 bp in length. 

2. Construction of full-Length a 1A coding sequences 

Portions of clone al.381, the 768-bp nucleic acid 
amplification product, clone al.278, the 559-bp nucleic acid 
amplification product, and clone al.244 were ligated at 
convenient restriction sites to generate a full-length a 1A 
coding sequence referred to as a 1A .j. 

Comparison of the results of sequence analysis of clones 
al.244 and al.254 indicated that the primary transcript of the 
a 1A subunit gene is alternatively spliced to yield at least two 
variant mRNAs encoding different forms of the a 1A subunit . One 
form, <v 1A -i/ is encoded by the sequence shown in SEQ ID No. 22. 
The sequence encoding a second form, a 1A _ 2 , differs from the a 1A _ 
i-encoding sequence at the 3' end in that it lacks a 5-nt 
sequence found in clone al.244 (nucleotides 7035-7039 of SEQ 
ID No. 22) . This deletion shifts the reading frame and 
introduces a translation termination codon resulting in an a 1A _ 2 
coding sequence that encodes a shorter a 1A subunit than that 
encoded by the a 1A _.! splice variant. Consequently, a portion 
of the 3' end of the a lA .j coding sequence is actually 3' 
untranslated sequence in the a 1A _ 2 DNA. The complete sequence 
°f a iA-2/ which can be constructed by ligating portions of clone 
al.381, the 768-bp nucleic acid amplification product, clone 
al.278, the 559-bp nucleic acid amplification product and 
clone al.254, is set forth in SEQ ID No. 23. 

E. Isolation of DNA Encoding the a 1E Subunit 

DNA encoding a 1E subunits of the human calcium channel 
were isolated from human hippocampus libraries. The selected 
clones sequenced. DNA sequence analysis of DNA clones 
encoding the a 1E subunit indicated that at least two 
alternatively spliced forms of the same a 1E subunit primary 
transcript are expressed. One form has the sequence set forth 
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in SEQ ID No. 24 and was designated and the other was 

designated or 1E . 3 , which has the sequence obtained by inserting 
a 57 base pair fragment between nucleotides 2405 and 2406 of 
SEQ ID No. 24. The resulting sequence is set forth in SEQ ID 
No. 25. 

The subunit designated a 1E . 2 has a calculated molecular 
weight of 254,836 and the subunit designated a 1E _ 3 has a 
calculated molecular weight of 257,34 8. a 1E . 3 has a 19 amino 
acid insertion (encoded by SEQ ID No. 25) relative to ot^ in 
the region that appears to be the cytoplasmic loop between 
transmembrane domains IIS6 and IIIS1. 

EXAMPLE III: ISOLATION OF cDNA CLONES ENCODING THE HUMAN 

NEURONAL CALCIUM CHANNEL ^ subunit 
A. Isolation of partial cDNA clones encoding the /5 
subunit and construction of a full-length clone 
encoding the ^ subunit 

A human hippocampus cDNA library was screened with the 
rabbit skeletal muscle calcium channel subunit cDNA 

fragment (nt 441 to 1379) [for isolation and sequence of the 
rabbit skeletal muscle calcium channel /3j subunit cDNA, see 
U.S. Patent Application Serial NO. 482,384 or Ruth et al . 
(1989) Science 245:1115] using standard hybridization 
conditions (Example I.e.). A portion of one of the 
hybridizing clones was used to rescreen the hippocampus 
library to obtain additional cDNA clones. The cDNA inserts of 
hybridizing clones were characterized by restriction mapping 
and DNA sequencing and compared to the rabbit skeletal muscle 
calcium channel ^ 1 subunit cDNA sequence. 

Portions of the partial /3 1 subunit cDNA clones were 
ligated to generate a full-length clone encoding the entire /Sj 
subunit. SEQ ID No. 9 shows the jSj subunit coding sequence 
(nt 1-1434) as well as a portion of the 3' untranslated 
sequence (nt 1435-1546) . The deduced amino acid sequence is 
also provided in SEQ ID No. 9. In order to perform expression 
experiments, full-length ^ subunit cDNA clones were 
constructed as follows . 
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Step l: DNA fragment 1 (-800 bp of 5' untranslated 
sequence plus nt 1-277 of SEQ ID No. 9) was ligated to DNA 
fragment 2 (nt 277-1546 of SEQ ID No. 9 plus 44 8 bp of intron 
sequence) and cloned into pGEM7Z. The resulting plasmid, p/3l- 
1.18, contained a full-length /Jj subunit clone that included 
a 44 8 -bp intron. 

Step 2: To replace the 5' untranslated sequence of p/3l- 
1.18 with a ribosome binding site, a double -stranded adapter 
was synthesized that contains an EcoRI site, sequence encoding 
a ribosome binding site (5' -ACCACC-3 ' ) and nt 1-25 of SEQ ID 
No. 9. The adapter was ligated to Sinai -digested p/Sl-1.18, and 
the products of the ligation reaction were digested with 
EcoRI . 

Step 3 : The EcoRI fragment from step 2 containing the 
EcoRI adapter, efficient ribosome binding site and nt 1-1546 
of SEQ ID No. 9 plus intron sequence was cloned into a plasmid 
vector and designated p/81-1 . 18RBS . The EcoRI fragment of p/3l- 
1.18RBS was subcloned into EcoRI- digested pcDNAl with the 
initiation codon proximal to CMV promoter to form 
pHBCaCH/3 la RBS (A) . 

Step 4: To generate a full-length clone encoding the /S a 
subunit lacking intron sequence, DNA fragment 3 (nt 6 9-1146 of 
SEQ ID No. 9 plus 448 bp of intron sequence followed by nt 
1147-1546 of SEQ ID No. 9), was subjected to site-directed 
mutagenesis to delete the intron sequence, thereby yielding 
pjSl(-). The EcoRI-XhoI fragment of p/3l-1.18RBS (containing of 
the ribosome binding site and nt 1-277 of SEQ ID No. 9) was 
ligated to the XhoI-EcoRI fragment of p/3l(-) (containing of nt 
277-1546 of SEQ ID No. 9) and cloned into pcDNAl with the 
initiation of translation proximal to the CMV promoter. The 
resulting expression plasmid was designated pHBCaCH/? lb RBS (A) . 

B. Splice Variant f} x _ 3 

DNA sequence analysis of the DNA clones encoding the /3 1 
subunit indicated that in the CNS at least two alternatively 
spliced forms of the same human /8 X subunit primary transcript 
are expressed. One form is represented by the sequence shown 
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in SEQ ID No. 9 and is referred to as p 1 . 2 . The sequences of 
and the alternative form, /^.j, diverge at nt 1334 (SEQ 
ID No. 9) . The complete /3 10 sequence (nt 1-1851) , including 
3' untranslated sequence (nt 1795-1851), is set forth in SEQ 
ID No. 10. 

EXAMPLE XV: ISOLATION OP cDNA CLONES ENCODING THE HUMAN 

NEURONAL CALCIUM CHANNEL a 2 - subunit 
A. Isolation of cDNA clones 

The complete human neuronal at 2 coding sequence (nt 35- 
3310) plus a portion of the 5' untranslated sequence (nt 1 to 
34) as well as a portion of the 3' untranslated sequence (nt 
3311-3600) is set forth in SEQ ID No. 11. 

To isolate DNA encoding the human neuronal a 2 subunit, 
human a 2 genomic clones first were isolated by probing human 
genomic Southern blots using a rabbit skeletal muscle calcium 
channel a 2 subunit cDNA fragment [nt 43 to 272, Ellis et al . 
(1988) Science 240:1661] . Human genomic DNA was digested with 
EcoRl, electrophoresed, blotted, and probed with the rabbit 
skeletal muscle probe using standard hybridization conditions 
(Example I.C.) and low stringency washing conditions (Example 
I.C.). Two restriction fragments were identified, 3.5 kb and 
3.0 kb. These BcoRI restriction fragments were cloned by 
preparing a Xgtll library containing human genomic EcoRI 
fragments ranging from 2.2 kb to 4.3 kb. The library was 
screened as described above using the rabbit at 2 probe, 
hybridizing clones were isolated and characterized by DNA 
sequencing. HGCaCHa2.20 contained the 3.5 kb fragment and 
HGCaCHa2 . 9 contained the 3 . 0 kb fragment . 

Restriction mapping and DNA sequencing revealed that 
HGCaCHa2.20 contains an 82 bp exon (nt 130 to 211 of the human 
a 2 coding sequence, SEQ ID No. 11) on a 650 bp Pstl-Xbal 
restriction fragment and that HGCaCHa2 . 9 contains 105 bp of an 
exon (nt 212 to 316 of the coding sequence, SEQ ID No. 11) on 
a 750 bp Xba.I-Bgll~L restriction fragment. These restriction 
fragments were used to screen the human basal ganglia cDNA 
library (Example II.C.2.a.). HBCaCHor2 . 1 was isolated (nt 29 
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to 1163, SEQ ID No. 11) and used to screen a human brain stem 
cDNA library (ATCC Accession No. 37432) obtained from the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, MD. 20852. Two clones were isolated, HBCaCHa2 . 5 
(nt 1 to 1162, SEQ ID No. 11) and HBCaCHa2 . 8 (nt 714 to 1562, 
SEQ ID No. 11, followed by 1600 nt of intervening sequence) . 
A 2400 bp fragment of HBCaCHof2 . 8 (beginning at nt 759 of SEQ 
ID No. 11 and ending at a Smal site in the intron) was used to 
rescreen the brain stem library and to isolate HBCaCHor2.il (nt 
879 to 3600, SEQ ID No. 11). Clones HBCaCHof2 . 5 and 
HBCaCHo:2.11 overlap to encode an entire human brain a 2 
protein. 

B. Construction of pHB CaCHor 2 A 

To construct pHBCaCHo-jA containing DNA encoding a full- 
length human calcium channel a 2 subunit, an (EcoRI) -Pvull 
fragment of HBCaCHor2.5 (nt 1 to 1061, SEQ ID No. 11, EcoRI 
adapter, Pvull partial digest) and a PvuII-PstI fragment of 
HBCaCHQf2.11 (nt 1061 to 2424 SEQ ID No. 11; Pvull partial 
digest) were ligated into EcoRI-Pstl- digested pIBI24 
(Stratagene, La Jolla, CA) . Subsequently, an (EcoRI) -PstI 
fragment (nt 1 to 2424 SEQ ID No. 11) was isolated and ligated 
to a PstI- (EcoRI) fragment (nt 2424 to 3600 SEQ ID No. 11) of 
HBCaCHa2.ll in £coRI -digested pIBI24 to produce DNA, HBCaCHa2 , 
encoding a full-length human brain o- 2 subunit. The 3600 bp 
EcoRI insert of HBCaCHor2 (nt 1 to 3600, SEQ ID No. 11) was 
subcloned into pcDNAl (pHBCaCHo(2A) with the methionine 
initiating codon proximal to the CMV promoter. The 3600 bp 
EcoRI insert of HBCaCHa2 was also subcloned into pSV2dHFR 
[Subramani et aJ . (1981). Afol . Cell. Biol. 1:854-864] which 
contains the SV4 0 early promoter, mouse dihydrof olate 
reductase (dhfr) gene, SV40 polyadenylation and splice sites 
and sequences required for maintenance of the vector in 
bacteria. 
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EXAMPLE V. DIFFERENTIAL PROCESSING OF THE HUMAN 6 

TRANSCRIPT AND THE HUMAN or 2 TRANSCRIPT 
A. Differential processing of the B^ transcript 

Nucleic acid amplification analysis of the human /3 2 
transcript present in skeletal muscle, aorta, hippocampus and 
basal ganglia, and HEK 293 cells revealed differential 
processing of the region corresponding to nt 615-781 of SEQ ID 
No. 9 in each of the tissues. Four different sequences that 
result in five different processed B t transcripts through this 
region were identified. The B 1 transcripts from the different 
tissues contained different combinations of the four 
sequences, except for one of the B 1 transcripts expressed in 
HEK 293 cells (0 X . 5 ) which lacked all four sequences. 

None of the B x transcripts contained each of the four 

sequences; however, for ease of reference, all four sequences 

are set forth end-to-end as a single long sequence in SEQ ID 

No. 12. The four sequences that are differentially processed 

are sequence 1 (nt 14-34 in SEQ ID No. 12), sequence 2 (nt 35- 

55 in SEQ ID No. 12), sequence 3 (nt 56-190 in SEQ ID No. 12) 

and sequence 4 (nt 191-271 in SEQ ID No. 12). The forms of 

the (3 1 transcript that have been identified include: (l) a 

form that lacks sequence 1 called B^ (expressed in skeletal 

muscle) , (2) a form that lacks sequences 2 and 3 called P 1 . 2 

(expressed in CNS) , (3) a form that lacks sequences 1, 2 and 3 

called B 1 . 4 (expressed in aorta and HEK cells) and (4) a form 

that lacks sequences 1-4 called B^ (expressed in HEK cells) . 

Additionally, the B lm , and B x . 5 contain a guanine nucleotide (nt 

13 in SEQ ID No. 12) that is absent in the B 1 . 1 and B x _ 2 forms. 

The sequences of B 1 splice variants are set forth in SEQ ID 

Nos. 9, 10 and 33-35. 

B. Differential processing of transcripts encoding the 
a 2 subunit. 

The complete human neuronal a 2 coding sequence (nt 3 5- 
33 07) plus a portion of the 5' untranslated sequence (nt l to 
34) as well as a portion of the 3' untranslated sequence (nt 
3308-3600) is set forth as SEQ ID No. 11. 
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Nucleic acid amplification analysis of the human ct 2 
transcript present in skeletal muscle, aorta, and CNS revealed 
differential processing of the region corresponding to nt 
1595-1942 of SEQ ID No. 11 in each of the tissues. 

The analysis indicated that the primary transcript 
of the genomic DNA that includes the nucleotides corresponding 
to nt 1595-1942 also includes an additional sequence (SEQ ID 
No. 13: 5 ' C CT ATTG G T G T AG G T AT A C C AA C AAT T AAT TT 
AAGAAAAAGGAGACCCAATATCCAG 3') inserted between nt 1624 and 
1625 of SEQ ID No. li. Five alternatively spliced variant 

transcripts that differ in the presence or absence of one to 
three different portions of the region of the primary 
transcript that includes the region of nt 1595-1942 of SEQ ID 
No. 11 plus SEQ ID No. 13 inserted between nt 1624 and 1625 
have been identified. The five or 2 -encoding transcripts from 
the different tissues include different combinations of the 
three sequences, except for one of the a 2 transcripts 
expressed in aorta which lacks all three sequences . None of 
the of 2 transcripts contained each of the three sequences. The 
sequences of the three regions that are differentially 
processed are sequence 1 (SEQ ID No. 13) , sequence 2 (5' 
AACCCCAAATCTCAG 3', which is nt 1625-1639 of SEQ ID No. 11), 
and sequence 3 (5' CAAAAAAGGGCAAAATGAAGG 3', which is nt 
1908-1928 of SEQ ID No. 11) . The five a 2 forms identified are 
(1) a form that lacks sequence 3 called a 2a (expressed in 
skeletal muscle) , (2) a form that lacks sequence 1 called a 2b 
(expressed in CNS) , (3) a form that lacks sequences 1 and 2 
called a 2c expressed in aorta) , (4) a form that lacks 
sequences 1, 2 and 3 called a 2d (expressed in aorta) and (5) 
a form that lacks sequences 1 and 3 called ot 2e (expressed in 
aorta) . 

The sequences of or 2a - a 2c are set forth in SEQ. ID Nos . 
29 - 32, respectively. 
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EXAMPLE VI: ISOLATION OF DNA ENCODING A CALCIUM CHANNEL y 

SUB UNIT FROM A HUMAN BRAIN cDNA LIBRARY 

A. Isolation of DNA encoding the 7 subunit 

Approximately l x 10 s recombinants from a Xgtll -based 
human hippocampus cDNA library (Clontech catalog #HL1088b, 
Palo Alto, CA) were screened by hybridization to a 4 84 bp 
sequence of the rabbit skeletal muscle calcium channel y 
subunit cDNA (nucleotides 621-626 of the coding sequence plus 
438 nucleotides of 3 ' -untranslated sequence) contained in 
vector 7JIO [Jay, S. et al . (1990). Science 248:490-492] . 
Hybridization was performed using moderate stringency 
conditions (20% deionized formamide, 5x Denhardt's, 6 x SSPE, 
0.2% SDS, 20 /zg/ml herring sperm DNA, 42°C) and the filters 
were washed under low stringency (see Example I.e.) . A plaque 
that hybridized to this probe was purified and insert DNA was 
subcloned into pGEM7Z. This cDNA insert was designated yl. 4. 

B. Characterization of yl.4 

•yl.4 was confirmed by DNA hybridization and characterized 
by DNA sequencing. The 1500 bp SstI fragment of yl.4 
hybridized to the rabbit skeletal muscle calcium channel y 
subunit cDNA yJlO on a Southern blot. SEQ analysis of this 
fragment revealed that it contains of approximately 500 nt of 
human DNA sequence and -1000 nt of Xgtll sequence (included 
due to apparent destruction of one of the EcoRI cloning sites 
in Xgtll) . The human DNA sequence contains of 129 nt of 
coding sequence followed immediately by a translational STOP 
codon and 3' untranslated sequence (SEQ ID No. 14) . 

To isolate the remaining 5' sequence of the human y 
subunit cDNA, human CNS cDNA libraries and/or preparations of 
mRNA from human CNS tissues can first be assayed by nucleic 
acid amplification analysis methods using oligonucleotide 
primers based on the 7 cDNA-specif ic sequence of yl.4. 
Additional human neuronal y subunit -encoding DNA can be 
isolated from cDNA libraries that, based on the results of the 
nucleic acid amplification analysis assay, contain y-specific 
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amplifiable cDNA. Alternatively, cDNA libraries can be 
constructed from mRNA preparations that, based on the results 
of the nucleic acid amplification analysis assays, contain y- 
specific amplifiable transcripts. Such libraries are 

constructed by standard methods using oligo dT to prime first- 
strand cDNA synthesis from poly A* RNA (see Example I.B.). 
Alternatively, first-strand cDNA can be specified by priming 
first-strand cDNA synthesis with a y cDNA-specif ic 
oligonucleotide based on the human DNA sequence in -yl.4. A 
cDNA library can then be constructed based on this first - 
strand synthesis and screened with the 7-specific portion of 
71 .4 . 

EXAMPLE VII: ISOLATION OF cDNA CLONES ENCODING THE HUMAN 

NEURONAL Ca CHANNEL 0 2 SUB UNIT 

Isolation of DNA Encoding human calcium channel 0 2 
sub units 

Sequencing of clones isolated as described in Example III 
revealed a clone encoding a human neuronal calcium channel 0 2 
subunit (designated 0 2D see, SEQ ID No. 26) . An 
oligonucleotide based on the 5' end of this clone was used to 
prime a human hippocampus cDNA library. The library was 
screened with this 0 2 clone under conditions of low to medium 
stringency (final wash 0.5 X SSPE, 50° C) . Several 
hybridizing clones were isolated and sequenced. Among these 
clones were those that encode 0 2C , 0 2D and 0 2E . For example, 
the sequence of 0 2C is set forth in SEQ ID NO. 37, and the 
sequeence of 0 2E is set forth in SEQ ID No. 38. 

A randomly primed hippocampus library was then screened 
using a combination of the clone encoding 0 2D and a portion of 
the 0 3 clone deposited under ATCC Accession No. 69048. 
Multiple hybridizing clones were isolated. Among these were 
clones designated 0101, 0102 and 0104. 0101 appears to 
encodes the 5' end of a splice variant of 0 2 , designated 0 2E . 
0102 and 0104 encode portions of the 3' end of 0 2 . 

It appears that the 0 2 splice variants include 
nucleotides 182-2294 of SEQ ID No. 26 and differ only between 
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the start codon and nucleotides that correspond to 212 of SEQ. 
ID No. 26. 

EXAMPLE VIII: ISOLATION OF cDNA CLONES ENCODING HUMAN 
CALCIUM CHANNEL /S 4 and /S 3 SUBUNITS 

A. Isolation of eDNA Clones Encoding a Human 0 t Subunit 

A clone containing a translation initiation codon and 
approximately 60% of the 0 4 coding sequence was obtained from 
a human cerebellum cDNA library (see nucleotides 1-894 of 
Sequence ID No. 27) . To obtain DNA encoding the remaining 3' 
portion of the /3« coding sequence, a human cerebellum cDNA 
library was screened for hybridization a nucleic acid 
amplification product under high stringency hybridization and 
wash conditions. Hybridizing clones are purified and 
characterized by restriction enzyme mapping and DNA sequence 
analysis to identify those that contain sequence corresponding 
to the 3 ' end of the j8« subunit coding sequence and a 
termination codon. Selected clones are ligated to the clone 
containing the 5' half of the 0 4 coding sequence at convenient 
restriction sites to generate a full-length cDNA encoding a /3 4 
subunit. The sequence of a full-length /8 4 clone is set forth 
in SEQ ID No. 27; the amino acid sequence is set forth in SEQ 
ID No. 28. 

B. Isolation of cDNA Clones Encoding a Human /S3 
Subunit 

Sequencing of clones isolated as described in Example III 
also revealed a clone encoding a human neuronal calcium 
channel (3 3 subunit. This clone has been deposited as plasmid 
01.42 (ATCC Accession No. 69048) . 

To isolate a full-length cDNA clone encoding a complete 
/3 3 subunit, a human hippocampus cDNA library (Stratagene, La 
Jolla, CA) was screened for hybridization to a 5' EcoRI-PstI 
fragment of the cDNA encoding (S^.^ using lower stringency 
hybridization conditions (20% deionized formamide, 200 (ig/ml 
sonicated herring sperm DNA, 5X SSPE, 5X Denhardt ' s solution, 
42° C) and wash conditions. One of the hybridizing clones 
contained both translation initiation and termination codons 
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and encodes a complete /3 3 subunit designated (Sequence ID 

No. 19) . In vitro transcripts of the cDNA were prepared and 
injected into Xenopus oocytes along with transcripts of the 
a 1B . x and ar 2b cDNAs using methods similiar to those described in 
Example IX. D. Two-electrode voltage clamp recordings of the 
oocytes revealed significant voltage -dependent inward Ba 2 * 
currents . 

An additional 0 3 subunit -encoding clone, designated /3 3 _ 2 , 
was obtained by screening a human cerebellum cDNA library for 
hybridization to the nucleic acid amplification product 
referred to in Example VIII .A. under lower stringency (20% 
deionized formamide, 200 ng/ml sonicated herring sperm DNA, 5X 
SSPE, 5X Denhardt's solution, 42° C) hybridization and wash 
conditions. The 5' ends of this clone (Sequence ID No. 20, 0 3 . 
2 ) and the first /3 3 subunit, designated p 3 . lt (Sequence ID No. 
19) differ at their 5' ends and are splice variants of the /S 3 
gene . 

EXAMPLE IX: RECOMBINANT EXPRESSION OF HUMAN NEURONAL 

CALCIUM CHANNEL SUBUNIT- ENCODING cDNA AND RNA 
TRANSCRIPTS IN MAMMALIAN CELLS 

A. Recombinant Expression of the Human 

Neuronal Calcium Channel or 2 subunit cDNA 
in DG44 Cells 

1. Stable transfection of DG44 cells 

DG44 cells [dhfr - Chinese hamster ovary cells; see, e.g., 
Urlaub, G. et al . (1986) Som. Cell Molec. Genet. 12:555-566] 
obtained from Lawrence Chasin at Columbia University were 
stably transfected by CaPO« precipitation methods [Wigler et 
al. (1979) Proc. Natl. Acad. Sci . USA 76:1373-1376] with 
pSV2dhfr vector containing the human neuronal calcium channel 
cxj-subunit cDNA (see Example IV) for polycistronic 
expression/selection in transfected cells. Transf ectants were 
grown on 10% DMEM medium without hypoxanthine or thymidine in 
order to select cells that had incorporated the expression 
vector. Twelve transf ectant cell lines were established as 
indicated by their ability to survive on this medium. 



WO 95/04822 



PCT/US94/09230 



-87- 

2. Analysis of ct 2 subunit cDNA expression in 
transfected DG44 cells 

Total RNA was extracted according to the method of 
Birnboim [(1988) Nuc. Acids Res. 15:1487-1497] from four of 
the DG44 cell lines that had been stably transfected with 
pSV2dhfr containing the human neuronal calcium channel ot 2 
subunit cDNA. RNA (-15 fig per lane) was separated on a 1% 
agarose formaldehyde gel, transferred to nitrocellulose and 
hybridized to the random-primed human neuronal calcium channel 
a 2 cDNA (hybridization: 50% formamide, 5 x SSPE, 5 x 
Denhardt's, 42° C. ; wash :0.2 x SSPE, 0.1% SDS, 65° C). 
Northern blot analysis of total RNA from four of the DG44 cell 
lines that had been stably transfected with pSV2dhfr 
containing the human neuronal calcium channel a 2 subunit cDNA 
revealed that one of the four cell lines contained hybridizing 
mRNA the size expected for the transcript of the a 2 subunit 
cDNA (5000 nt based on the size of the cDNA) when grown in the 
presence of 10 mM sodium butyrate for two days. Butyrate 
nonspecifically induces transcription and is often used for 
inducing the SV4 0 early promoter [Gorman, C. and Howard, B. 
(1983) Nucleic Acids Res. 22:1631]. This cell line, 44a 2 -9, 
also produced mRNA species smaller (several species) and 
larger (6800 nt) than the size expected for the transcript of 
the ot 3 cDNA (5000 nt) that hybridized to the a 2 . cDNA-based 
probe. The 5000- and 6800-nt transcripts produced by this 
transfectant should contain the entire a 2 subunit coding 
sequence and therefore should yield a full-length a 2 subunit 
protein. A weakly hybridizing 8000-nucleotide transcript was 
present in untransf ected and transfected DG44 cells. 
Apparently, DG44 cells transcribe a calcium channel a 2 subunit 
or similar gene at low levels. The level of expression of 
this endogenous ct 2 subunit transcript did not appear to be 
affected by exposing the cells to butyrate before isolation of 
RNA for northern analysis. 

Total protein was extracted from three of the DG44 cell 
lines that had been stably transfected with pSV2dhfr 
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containing the human neuronal calcium channel c* 2 subunit cDNA. 
Approximately 10 7 cells were sonicated in 300 /il of a solution 
containing 50 mM HEPES, 1 mM EDTA, 1 mM PMSF. An equal volume 
of 2x loading dye [Laemmli, U.K. (1970). Mature 227:680] was 
added to the samples and the protein was subjected to 
electrophoresis on an 8% polyacrylamide gel and then 
electrotransf erred to nitrocellulose. The nitrocellulose was 
incubated with polyclonal guinea pig antisera (1:200 dilution) 
directed against the rabbit skeletal muscle calcium channel a 2 
subunit (obtained from K. Campbell, University of Iowa) 
followed by incubation with [ 125 I] -protein A. The blot was 
exposed to X-ray film at -70° C. Reduced samples of protein 
from the transfected cells as well as from untransf ected DC344 
cells contained immunoreact i ve protein of the size expected 
for the a 2 subunit of the human neuronal calcium channel (130- 
150 kDa) . The level of this immunoreact ive protein was higher 
in 44a 2 -9 cells that had been grown in the presence of 10 mM 
sodium butyrate than in 44a 2 -9 cells that were grown in the 
absence of sodium butyrate. These data correlate well with 
those obtained in northern analyses of total RNA from 44a 2 -9 
and untransf ected DG44 cells. Cell line 44a 2 -9 also produced 
a 110 kD immunoreact ive protein that may be either a product 
of proteolytic degradation of the full-length a 2 subunit or a 
product of translation of one of the shorter (<5000 nt) mRNAs 
produced in this cell line that hybridized to the a 2 subunit 
cDNA probe. 

B. Expression of DNA encoding human 
neuronal calcium channel a x , a 2 and 0 X 
subunits in HEK cells 

Human embryonic kidney cells (HEK 293 cells) were 
transiently and stably transfected with human neuronal DNA 
encoding calcium channel subunits . Individual transf ectants 
were analyzed electrophysiologically for the presence of 
voltage-activated barium currents and functional recombinant 
voltage -dependent calcium channels were. 
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1. Transfection of HEK 293 cells 

Separate expression vectors containing DNA encoding human 
neuronal calcium channel a 1D , a 2 and fi x subunits, plasmids 
pVDCCIII(A), pHBCaCHa!^, and pHBCaCH^RBS (A) , respectively, 
were constructed as described in Examples II. A. 3, IV. B. and 
III.B.3., respectively. These three vectors were used to 
transiently co-transfect HEK 293 cells. For stable 

transfection of HEK 293 cells, vector pHBCaCH/3 lb RBS (A) (Example 
III.B.3.) was US ed in place of P HBCaCH/? la RBS (A) to introduce 
the DNA encoding the 0, subunit into the cells along with 
pVDCCIII (A) and pHBCaCHa 2 A. 

a. Transient transfection 
Expression vectors pVDCCIII (A) , pHBCaCHoijA and 
P HBCaCH0 la RBS(A) were used in two sets of transient 
transfections of HEK 293 cells (ATCC Accession No. CRL1573). 
In one transfection procedure, HEK 293 cells were transiently 
cotransfected with the a x subunit cDNA expression plasmid, the 
a 2 subunit cDNA expression plasmid, the /3, subunit cDNA 
expression plasmid and plasmid pCMV/3gal (Clontech 
Laboratories, Palo Alto, CA) . Plasmid pCMV/Jgal contains the 
lacz gene (encoding E. coli /5-galactosidase) fused to the 
cytomegalovirus (CMV) promoter and was included in this 
transfection as a marker gene for monitoring the efficiency of 
transfection. In the other transfection procedure, HEK 293 
cells were transiently co-transf ected with the ffl subunit cDNA 
expression plasmid pVDCCIII (A) and pCMV/Sgal . In both 
transfections, 2-4 x 10 6 HEK 293 cells in a 10-cm tissue 
culture plate were transiently co-transf ected with 5 /zg of 
each of the plasmids included in the experiment according to 
standard CaPO« precipitation transfection procedures (Wigler 
et al. (1979) Proc. Natl. Acad. Sci. USA 76:1373-1376). The 
transfectants were analyzed for jS-galactosidase expression by 
direct staining of the product of a reaction involving /3- 
galactosidase and the X-gal substrate [Jones, J.R. (1986) EMBO 
5:3133-3142] and by measurement of /3-galactosidase activity 
[Miller, J.H. (1972) Experiments in Molecular Genetics, pp. 
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352-355, Cold Spring Harbor Press] . To evaluate subunit cDNA 
expression in these transf ectants , the cells were analyzed for 
subunit transcript production (northern analysis) , subunit 
protein production (immunoblot analysis of cell lysates) and 
functional calcium channel expression (electrophysiological 
analysis) . 

b. Stable transf ection 

HEK 293 cells were transfected using the calcium 
phosphate transf ection procedure [Current Protocols in 
Molecular Biology, Vol. l, Wiley Inter- Science, Supplement 14, 
Unit 9.1.1-9.1.9 (1990)]. Ten-cm plates , each containing one- 
to-two million HEK 293 cells, were transfected with 1 ml of 
DNA/calcium phosphate precipitate containing 5 ng pVDCCIII (A) , 
5 ng P HBCaCHa 2 A, 5/xg P HBCaCH/3 lb RBS (A) , 5 pg pCMVBgal and 1 /ig 
pSV2neo (as a selectable marker) . After 10-20 days of growth 
in media containing 500 fig G418, colonies had formed and were 
isolated using cloning cylinders. 

2. Analysis of HEK 293 cells transiently 

transfected with DNA encoding human neuronal 

calcium channel subunite 

a. Analysis of jS-galactosidase expression 

Transient transf ectants were assayed for /?-galactosidase 
expression by /3-galactosidase activity assays (Miller, J.H., 

(1972) Experiments in Molecular Genetics, pp. 352-355, Cold 
Spring Harbor Press) of cell lysates (prepared as described in 
Example VII. A. 2) and staining of fixed cells (Jones, J.R. 

(1986) EMBO 5:3133-3142). The results of these assays 
indicated that approximately 30% of the HEK 293 cells had been 
transfected. 

b. Northern analysis 

PolyA+ RNA was isolated using the Invitrogen Fast Trak 
Kit (InVitrogen, San Diego, CA) from HEK 293 cells transiently 
transfected with DNA encoding each of the o lf oi 2 and I3 1 
subunits and the la.cZ gene or the a x subunit and the la.cZ 
gene. The RNA was subjected to electrophoresis on an agarose 
gel and transferred to nitrocellulose. The nitrocellulose was 
then hybridized with one or more of the following radiolabeled 
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probes: the lacZ gene, human neuronal calcium channel o 1D 
subunit -encoding cDNA, human neuronal calcium channel a 2 
subunit- encoding cDNA or human neuronal calcium channel p i 
subunit -encoding cDNA. Two transcripts that hybridized with 
the or, subunit -encoding cDNA were detected in HEK 293 cells 
transfected with the DNA encoding the a lt a 3 , and p l subunits 
and the lacZ gene as well as in HEK 293 cells transfected with 
the a, subunit cDNA and the lacZ gene. One mRNA species was 
the size expected for the transcript of the a, subunit cDNA 
(8000 nucleotides) . The second RNA species was smaller (4000 
nucleotides) than the size expected for this transcript. RNA 
of the size expected for the transcript of the lacZ gene was 
detected in cells transfected with the a lt a 2 and 0, subunit - 
encoding cDNA and the lacZ gene and in cells transfected with 
the a x subunit cDNA and the lacZ gene by hybridization to the 
lacZ gene sequence. 

RNA from cells transfected with the a lt at 2 and & subunit - 
encoding cDNA and the lacZ gene was also hybridized with the 
0f a and 0 l subunit cDNA probes . Two mRNA species hybridized to 
the a 2 subunit cDNA probe. One species was the size expected 
for the transcript of the oc 2 subunit cDNA (4000 nucleotides) . 
The other species was larger (6000 nucleotides) than the 
expected size of this transcript. Multiple RNA species in the 
cells co-transfected with or,, a 2 and /S x subunit -encoding cDNA 
and the lacZ gene hybridized to the 0 1 subunit cDNA probe. 
Multiple 0 subunit transcripts of varying sizes were produced 
since the 0 subunit cDNA expression vector contains two 
potential polyA' addition sites. 

c. Electrophysiological analysis 
Individual transiently transfected HEK 293 cells were 
assayed for the presence of volt age -dependent barium currents 
using the whole-cell variant of the patch clamp technique 
[Hamill et al . (1981). Pflugers Arch. 392:85-100]. HEK 293 
cells transiently transfected with pCMV/Sgal only were assayed 
for barium currents as a negative control in these 
experiments. The cells were placed in a bathing solution that 
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contained barium ions to serve as the current carrier. 
Choline chloride, instead of NaCl or KCl, was used as the 
major salt component of the bath solution to eliminate 
currents through sodium and potassium channels. The bathing 
solution contained 1 mM MgCl 2 and was buffered at pH 7.3 with 
10 mM HEPES (pH adjusted with sodium or tetraethylammonium 
hydroxide) . Patch pipettes were filled with a solution 
containing 135 mM CsCl, 1 mM MgCl 2< 10 mM glucose, 10 mM EGTA, 
4 mM ATP and 10 mM HEPES (pH adjusted to 7.3 with 
tetraethylammonium hydroxide) . Cesium and tetraethylammonium 
ions block most types of potassium channels. Pipettes were 
coated with Sylgard (Dow-Corning, Midland, MI) and had 
resistances of 1-4 megohm. Currents were measured through a 
500 megohm headstage resistor with the Axopatch IC (Axon 
instruments, Foster City, CA) amplifier, interfaced with a 
Labmaster (Scientific Solutions, Solon, OH) data acquisition 
board in an IBM- compatible PC. PClamp (Axon Instruments) was 
used to generate voltage commands and acquire data. Data were 
analyzed with pClamp or Quattro Professional (Borland 
International, Scotts Valley^CA^rograms . 

To apply drugs, "puffer" pipettes positioned within 
several micrometers of the cell under study were used to apply 
solutions by pressure application. The drugs used for 
pharmacological characterization were dissolved in a solution 
identical to the bathing solution. Samples of a 10 mM stock 
solution of Bay K 8644 (RBI, Natick, MA), which was prepared 
in DMSO, were diluted to a final concentration of 1 mM in 15 
mM Ba 2 *- containing bath solution before they were applied. 

Twenty-one negative control HEK 293 cells (transiently 
transfected with the lacZ gene expression vector pCMV/?gal 
only) were analyzed by the whole-cell variant of the patch 
clamp method for recording currents. Only one cell displayed 
a discernable inward barium current; this current was not 
affected by the presence of 1 ^ Bay K 8644. In addition, 
application of Bay K 8644 to four cells that did not display 
Ba 2 ' currents did not result in the appearance of any currents . 



BNSDOCID: <WO 9504822A1_I_= 



WO 95/04822 



PCTAJS94/09230 



-93- 



Two days after transient transfection of HEK 293 cells 
with a lf a 2 and 0 X subunit -encoding cDNA and the lacZ gene, 
individual transf ectants were assayed for volt age -dependent 
barium currents. The currents in nine transf ectants were 
recorded. Because the efficiency of transfection of one cell 
can vary from the efficiency of transfection of another cell, 
the degree of expression of heterologous proteins in 
individual transf ectants varies and some cells do not 
incorporate or express the foreign DNA. Inward barium 
currents were detected in two of these nine transf ectants . In 
these assays, the holding potential of the membrane was -90 
mV. The membrane was depolarized in a series of voltage steps 
to different test potentials and the current in the presence 
and absence of 1 <iM Bay K 8644 was recorded. The inward 
barium current was significantly enhanced in magnitude by the 
addition of Bay K 8644. The largest inward barium current 
(-160 pA) was recorded when the membrane was depolarized to 0 
mV in the presence of l ^ Bay K 8644. A comparison of the 
I-V curves, generated by plotting the largest current recorded 
after each depolarization versus the depolarization voltage, 
corresponding to recordings conducted in the absence and 
presence of Bay K 8644 illustrated the enhancement of the 
voltage-activated current in the presence of Bay K 8644. 

Pronounced tail currents were detected in the tracings 
of currents generated in the presence of Bay K 8644 in HEK 293 
cells transfected with a lt a 2 and 0 a subunit -encoding cDNA and 
the JacZ gene, indicating that the recombinant calcium 
channels responsible for the voltage- activated barium currents 
recorded in this transfected appear to be DHP-sensitive . 

The second of the two transfected cells that displayed 
inward barium currents expressed a -50 pA current when the 
membrane was depolarized from -90 mV. This current was nearly 
completely blocked by 200 M M cadmium, an established calcium 
channel blocker. 

Ten cells that were transiently transfected with the DNA 
encoding the a, subunit and the lacZ gene were analyzed by 
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whole-cell patch clamp methods two days after transfection 
One of these cells displayed a 30 pA inward barium current 
This current amplified 2-fold in the presence of l m Bay K 
8644. Furthermore, small tail currents were detected in the 
presence of Bay K 8644. These data indicate that expression 
of the human neuronal calcium channel a 1D subunit -encoding cDNA 
in HEK 293 yields a functional DHP-sensitive calcium channel. 

3 " w?ti yS ^° f ****? 92 CellS Stabl * transacted 
with DNA encoding human neuronal calcium 
channel sub units 

individual stably transfected HEK 293 cells were assayed 

electrophysiological^ for the presence of voltage -dependent 

barium currents as described for electrophysiological analysis 

of transiently transfected HEK 293 cells (see Example 

VXI.B.2.C) . In an effort to maximize calcium channel activity 

via cyclic-AMP-dependent kinase-mediated phosphorylation 

[Pelzer, et al . (1990) Kev. Physiol. Biochem. Pharmacol 

114:107-207], cAMP (Na salt, 250 m was added to the pipet 

solution and forskolin (10 fiM) was added to the bath solution 

m some of the recordings. Qualitatively similar results were 

obtained whether these compounds were present or not. 

Barium currents were recorded from stably transfected 

cells in the absence and presence of Bay K 8644 (i m m) when 

the cell was depolarized to -10 mV from a holding potential of 

-90 rnv in the absence of Bay K 8644, a current of 

approximately 35pA with a rapidly deactivating tail current 

was recorded. During application of Bay K 8644. an identical 

depolarizing protocol elicited a current of approximately 75 

PA, accompanied by an augmented and prolonged tail current. 

The peak magnitude of currents recorded from this same cell as 

a function of a series of depolarizing voltages were assessed. 

The responses in the presence of Bay K 8644 not only 

increased, but the entire current -voltage relation shifted 

about -10 mV. Thus, three typical hallmarks of Bay K 8644 

action, namely increased current magnitude, prolonged tail 

currents, and negatively shifted activation voltage, were 
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observed, clearly indicating the expression of a DHP-sensitive 
calcium channel in these stably transfected cells. No such 
effects of Bay K 8644 were observed in untransf ected HEK 293 
cells, either with or without cAMP or forskolin. 

C. Use of pCMV-based vectors and pcDNAl-based vectors 

for expression of DNA encoding human neuronal 

calcium channel subunits 

1. Preparation of constructs 
Additional expression vectors were constructed using 
pCMV. The full-length a lt> cDNA from pVDCCIII (A) (see Example 

II. A.3.d). the full-length a 2 cDNA, contained on a 3600 bp 
EcoRI fragment from HBCaCHa 2 (see Example IV. B) and a full- 
length P 1 subunit cDNA from pHBCaCH/3 lb RBS (A) (see Example 

III. B.3) were separately subcloned into plasmid pCMV/?gal . 
Plasmid pCMV0gal was digested with WotI to remove the la.cZ 
gene. The remaining vector portion of the plasmid, referred 
to as pCMV, was blunt-ended at the Wot I sites. The full- 
length a 2 -encoding DNA and ^-encoding DNA, contained on 
separate EcoRI fragments, were isolated, blunt-ended and 
separately ligated to the blunt -ended vector fragment of pCMV 
locating the cDNAs between the CMV promoter and SV4 0 
polyadenylation sites in pCMV. To ligate the a 1D -encoding cDNA 
with pCMV, the restriction sites in the polylinkers 
immediately 5' of the CMV promoter and immediately 3' of the 
SV4 0 polyadenylation site were removed from pCMV. A 
polylinker was added at the WotI site. The polylinker had the 
following sequence of restriction enzyme recognition sites: 
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EcoRI 


Sail 


PStI 


ECORV 


HindHl Xball 


site 


site 


site 


site 


site site 



GGCCGC 

CG Site sites site rup sH v«* Uiho I CACCGG 

I * 

Destroys Not 

The a 1D -encoding DNA, isolated as a BaitMI/XhoI fragment from 
PVDCCIIKA), was then ligated to Xball/Sall-digested pCMV to 
place it between the CMV promoter and SV4 0 polyadenylation 
site. 

Plasmid pCMV contains the CMV promoter as does pcDNAl, 
but differs from pcDNAl in the location of splice donor/splice 
acceptor sites relative to the inserted subunit- encoding DNA. 
After inserting the subunit -encoding DNA into pCMV, the splice 
donor/ splice acceptor sites are located 3' of the CMV promoter 
and 5' of the subunit -encoding DNA start codon. After 
inserting the subunit -encoding DNA into pcDNAl, the splice 
donor/splice acceptor sites are located 3' of the subunit cDNA 
stop codon. 

2. Transfection of HER 293 cells 

HEK 293 cells were . transiently co-transf ected with the 
QfiD/ Qf 3 and /3 a subunit -encoding DNA in pCMV or with the a 1D , a 2 
and 0 subunit -encoding DNA in pcDNAl (vectors pVDCCIII (A) , 
pHBCaCHor 2 A and pHBCaCH0 lb RBS (A) , respectively) , as described in 
Example VII.B.l.a. Plasmid pCMVjSgal was included in each 
transfection as a measure of transfection efficiency. The 
results of 0-galactosidase assays of the transf ectants (see 
Example VII. B. 2.), indicated that HEK 293 cells were 
transfected equally efficiently with pCMV- and pcDNAl-based 
plasmids. The pcDNAl -based plasmids, however, are presently 
preferred for expression of calcium channel receptors. 

D. Expression in Xenopus laevis oocytes of RNA 
encoding human neuronal calcium channel subunite 

Various combinations of the transcripts of DNA encoding 
the human neuronal a 1D , a 2 and subunits prepared in vitro 
were injected into Xenopus laevis oocytes. Those injected 
with combinations that included a 1D exhibited voltage -activated 
barium currents. 
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1. Preparation of transcripts 

Transcripts encoding the human neuronal calcium channel 
a 1D , a 2 and subunits were synthesized according to the 
instructions of the mCAP mRNA CAPPING KIT (Strategene, La 
Jolla, CA catalog #200350). Plasmids pVDCC III.RBS(A), 
containing pcDNAl and the Qf 1D cDNA that begins with a ribosome 
binding site and the eighth ATG codon of the coding sequence 
(see Example III. A. 3. d) , plasmid pHBCaCHc^A containing pcDNAl 
and an a 2 subunit cDNA (see Example IV) , and plasmid 
pHBCaCH/S lb RBS (A) containing pcDNAl and the ^ DNA lacking 
intron sequence and containing a ribosome binding site (see 
Example III), were linearized by restriction digestion. The 
or 1D cDNA- and ot 2 subunit -encoding plasmids were digested with 
XhoT, and the /3 a subunit- encoding plasmid was digested with 
EcoRV. The DNA insert was transcribed with T7 RNA polymerase. 

2. Injection of oocytes 

Xenopus laevis oocytes were isolated and def olliculated 
by collagenase treatment and maintained in 100 mM NaCl, 2 mM 
KC1, 1.8 mM CaCl 2 , 1 mM MgCl 2 , 5 mM HEPES , pH 7.6, 20 ng/ml 
ampicillin and 25 ng/ml streptomycin at 19-25°C for 2 to 5 
days after injection and prior to recording. For each 
transcript that was injected into the oocyte, 6 ng of the 
specific mRNA was injected per cell in a total volume of 50 
nl, 

3 . Intracellular voltage recordings 

Injected oocytes were examined for voltage -dependent 
barium currents using two-electrode voltage clamp methods 
[Dascal, N. (1987) CRC Crit . Rev. Biochem. 22:317] . The 
pClamp (Axon Instruments) software package was used in 
conjunction with a Labmaster 125 kHz data acquisition 
interface to generate voltage commands and to acquire and 
analyze data. Quattro Professional was also used in this 
analysis. Current signals were digitized at 1-5 kHz, and 
filtered appropriately. The bath solution contained of the 
following: 40 mM BaCl 2 , 36 mM tetraethylammonium chloride 
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(TEA-C1), 2 mM KC1, 5 mM 4 -aminopyridine , 0.15 mM niflumic 
acid, 5 mM HEPES, pH 7.6. 

a. Electrophysiological analysis of oocytes 
injected with transcripts encoding the 
human neuronal calcium channel a,, a, and 
sub units 

Uninjected oocytes were examined by two-electrode voltage 
clamp methods and a very small (25 nA) endogenous inward Ba 2 * 
current was detected in only one of seven analyzed cells. 

Oocytes coinjected with a 1B , a 2 and ft subunit transcripts 
expressed sustained inward barium currents upon depolarization 
of the membrane from a holding potential of -90 mV or -50 mV 
(154 + 129 nA, n=21) . These currents typically showed little 
inactivation when test pulses ranging from 140 to 700 msec, 
were administered. Depolarization to a series of voltages 
revealed currents that first appeared at approximately -30 mV 
and peaked at approximately 0 mV. 

Application of the DHP Bay K 8644 increased the magnitude 
of the currents, prolonged the tail currents present upon 
repolarization of the cell and induced a hyperpol arizing shift 
in current activation. Bay K 8644 was prepared fresh from a 
stock solution in DMSO and introduced as a lOx concentrate 
directly into the 6 0 pi bath while the perfusion pump was 
turned off. The DMSO concentration of the final diluted drug 
solutions in contact with the cell never exceeded 0.1%. 
Control experiments showed that 0.1% DMSO had no effect on 
membrane currents . 

Application of the DHP antagonist nifedipine (stock 
solution prepared in DMSO and applied to the cell as described 
for application of Bay K 8644) blocked a substantial fraction 
(91 ± 6%, n=7) of the inward barium current in oocytes 
coinjected with transcripts of the a lD , a 2 and ft subunit s . A 
residual inactivating component of the inward barium current 
typically remained after nifedipine application. The inward 
barium current was blocked completely by 50 fiM Cd 2 *, but only 
approximately 15% by 100 /iM Ni 2 *. 
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The effect of wCgTX on the inward barium currents in 
oocytes co- injected with transcripts of the a 1D , a 2 . and fi 1 
subunits was investigated. uCgTX (Bachem, Inc., Torrance CA) 
was prepared in the 15 mM BaCl 2 bath solution plus 0.1% 
cytochrome C (Sigma) to serve as a carrier protein. Control 
experiments showed that cytochrome C had no effect on 
currents. A series of voltage pulses from a -90 mV holding 
potential to 0 mV were recorded at 20 msec, intervals. To 
reduce the inhibition of uCgTX binding by divalent cations, 
recordings were made in 15 mM BaCl 2 , 73.5 mM 
tetraethylammonium chloride, and the remaining ingredients 
identical to the 40 mM Ba a * recording solution. Bay K 8644 was 
applied to the cell prior to addition to uCgTX in order to 
determine the effect of uCgTX on the DHP-sensitive current 
component that was distinguished by the prolonged tail 
currents. The inward barium current was blocked weakly (54 ± 
29%, n=7) and reversibly by relatively high concentrations 
(10-15 /xM) of uCgTX. The test currents and the accompanying 
tail currents were blocked progressively within two to three 
minutes after application of uCgTX, but both recovered 
partially as the uCgTX was flushed from the bath. 

b Analysis of oocytes injected with only 
a transcripts encoding the human 
neuronal calcium channel ex 1D or 
transcripts encoding an a 1D and other 
subunits 

The contribution of the a 2 and (3, subunits to the inward 
barium current in oocytes injected with transcripts encoding 
the a a 2 and 0, subunits was assessed by expression of the 
a D sutanit alone or in combination with either the ft subunit 
or the a 2 subunit. In oocytes injected with only the 
transcript of a «„ cDNA, no Ba" currents were detected (n=3) . 
in oocytes injected with transcripts of and fi 1 cDNAs , small 
(108 ± 39 nA) Ba 2 ' currents were detected upon depolarization 
of the membrane from a holding potential of -90 mV that 
resembled the currents observed in cells injected with 
transcripts of «„. c* 2 and ft cDNAs , although the magnitude of 
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the current was less. Ia two of the four o6cyfces 
with transcripts of the ^-encoding and ^-encoding DNA the 
Ba * currents exhibited a sensitivity to Bay k 8644 that was 
similar to the Bay K 8644 sensitivity of Ba- currents 
expressed in oocytes injected with transcripts encoding the * 1D 
Qfa-, <* 2 . and ft subunits. 

Three of five oocytes injected with transcripts encoding 
the «„ and a 2 subunits exhibited very small Ba- currents (15- 
30 nA) upon depolarization of the membrane from a holding 
potentaal of -90 mv. These barium currents showed little or 
no response to Bay K 8644. 

e. Analysis of oocytes injected with 
transcripts encoding the human neuronal 
calcium channel a 2 and/or ft subunit 

To evaluate the contribution of the * 1D ^-subunit to the 
inward barium currents detected in oocytes co-injected with 
transcripts encoding the «„. a 2 and ft subunits, oocytes 
^ected with transcripts encoding the human neuronal calcium 
channel a 2 and/or ft subunits were assayed for barium currents 
Oocytes injected with transcripts encoding the « 2 subunit 
displayed no detectable inward barium currents (n=5) 

TZT / njSCted With tranSCr iptS enCOdi ^ * ^unit 
displayed measurable (54 ± 23 nA, n=5) inward barium currents 
upon depolarization and oocytes injected with transcripts 
encoding the a 2 and ft subunits displayed inward barium 
currents that were approximately 50% larger (80 ± 61 nA n=18) 
than those detected in oocytes injected with transcripts of 
the ft -encoding DNA only. 

The inward barium currents in oocytes injected with 
transcripts encoding the ft subunit or a 2 and ft subunits 
typically were first observed when the membrane was 
depolarized to -30 mV from a holding potential of -90 mV and 
peaked when the membrane was depolarized to 10 to 20 mV 
Macroscopically, the currents in oocytes injected with 
transcripts encoding the a 2 and ft subunits or with transcripts 
encoding the ft subunit were indistinguishable. m contrast 
to the currents in oocytes co-injected with transcripts of a 
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a 2 and @ x subunit cDNAs, these currents showed a significant 
inactivation during the test pulse and a strong sensitivity to 
the holding potential. The inward barium currents in oocytes 
co- injected with transcripts encoding the a 2 and ^ subunits 
usually inactivated to 10-60% of the peak magnitude during a 
140-msec pulse and were significantly more sensitive to 
holding potential than those in oocytes co- injected with 
transcripts encoding the or 1D , a 2 and & subunits. Changing the 
holding potential of the membranes of oocytes co- injected with 
transcripts encoding the a 2 and /3 X subunits from -90 to -50 mV 
resulted in an approximately 81% (n=ll) reduction in the 
magnitude of the inward barium current of these cells. In 
contrast, the inward barium current measured in oocytes co- 
injected with transcripts encoding the a lT> , ot 2 and 0! subunits 
were reduced approximately 24% (n=ll) when the holding 
potential was changed from -90 to -50 mV. 

The inward barium currents detected in oocytes injected 
with transcripts encoding the or-, and subunits were 

pharmacologically distinct from those observed in oocytes co- 
injected with transcripts encoding the a 1D , a 2 and 0 X subunits. 
Oocytes injected with transcripts encoding the a 2 and & 
subunits displayed inward barium currents that were 
insensitive to Bay K 8644 (n=ll) . Nifedipine sensitivity was 
difficult to measure because of the holding potential 
sensitivity of nifedipine and the current observed in oocytes 
injected with transcripts encoding the a 2 and p t subunits. 
Nevertheless, two oocytes that were co- injected with 
transcripts encoding the a 2 and subunits displayed 

measurable (25 to 45 nA) inward barium currents when 
depolarized from a holding potential of -50 mV. These 
currents were insensitive to nifedipine (5 to 10 pM) . The 
inward barium currents in oocytes injected with transcripts 
encoding the a 2 and /Sj subunits showed the same sensitivity to 
heavy metals as the currents detected in oocytes injected with 
transcripts encoding the a 1D , otj and subunits. 
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The inward barium current detected in oocytes injected 
with transcripts encoding the human neuronal a 2 and fil subunits 
has pharmacological and biophysical properties that resemble 
calcium currents in uninfected Xenopus oocytes. Because the 
ammo acids of this human neuronal calcium channel fil subunit 
lack hydrophobic segments capable of forming transmembrane 
domains, it is unlikely that recombinant fi t subunits alone can 
form an ion channel. it is more probable that a homologous 
endogenous or, subunit exists in oocytes and that the activity 
mediated by such an subunit is enhanced by expression of a 
human neuronal 0 1 subunit. 

E * Xf™? i0n ° f DNA J e "°° ding human neuronal calcium 
channel of 1B , a n and 0^ subunits in HEK cells 

1. Transfection of HEK cells 

The transient expression of the human neuronal a 1B a 

and fil . a subunits was studied in HEK293 cells. The HEK293 

cells were grown as a monolayer culture in Dulbecco's modified 

Eagle's medium (Gibco) containing 5% defined- supplemented 

bovine calf serum (Hyclone) plus penicillin G (100 U/ml) and 

steptomycin sulfate (100 ^g/ml) . HEK293 cell transf ections 

were mediated by calcium phosphate as described above 

Transfected cells were examined for inward Ba 2 * currents (J ) 

mediated by volt age -dependent Ca 2 * channels. 

Cells were transfected (2 x io< per polylysine-coated 

Plate. standard transf ections (10-cm dish) contained 8 „g 

of pcDNAQf 1B .j , 5 „ g of pHBCaCHdjA, 2 „g P HBCaCH/J lb RBS (A) (see 

Examples II.A.3, IV.B. and III) and 2 M g of CMV0 (Clontech) fi- 

glactosidase expression plasmid, and pUClB to maintain a 

constant mass of 20 /xg/ml . Cells were analyzed 48 to 72 hours 

after transfection. Transfection efficiencies (±10%), which 

were determined by in situ histochemical staining for 

0-galactosidase activity (Sanes et al . (i 98 6) EMBO J., 

5.-3133), generally were greater than 50%. 

2. Electrophysiological analysis of transf ectant 
currents 

a. Materials and methods 
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Properties of recombinantly expressed Ca 2 * channels were 
studied by whole cell patch-clamp techniques. Recordings were 
performed on transfected HEK293 cells 2 to 3 days after 
transfection. Cells were plated at 100,000 to 300,000 cells 
per polylysine- coated, 35-mm tissue culture dishes (Falcon, 
Oxnard, CA) 24 hours before recordings. Cells were perfused 
with 15 mM BaCl 2 , '125 mM choline chloride, 1 mM MgCl 2 , and 
10 mM Hepes (pH = 7.3) adjusted with tetraethylammonium 
hydroxide (bath solution) . Pipettes were filled with 135 mM 
CsCl, 10 mM EGTA, 10 mM Hepes, 4 mM Mg-adenosine triphosphate 
(pH = 7.5) adjusted with tetraethylammonium hydroxide. 
Sylgard (Dow-Corning, Midland, MI) -coated, fire-polished, and 
filled pipettes had resistances of 1 to 2 megohm before gigohm 
seals were established to cells. 

Bay K 8644 and nifedipine (Research Biochemicals , Natick, 
MA) were prepared from stock solutions (in dimethyl sulfoxide) 
and diluted into the bath solution. The dimethyl sulfoxide 
concentration in the final drug solutions in contact with the 
cells never exceeded 0.1%. Control experiments showed that 
0.1% dimethyl sulfoxide had no efect on membrane currents. 
uCgTX (Bachem, Inc., Torrance CA) was prepared in the 15 mM 
BaCl 2 bath solution plus 0.1% cytochrome C (Sigma, St. Louis 
MO) to serve as a carrier protein. Control experiments 
showed that cytochrome C had no effect on currents. These 
drugs were dissolved in bath solution, and continuously 
applied by means of puffer pipettes as required for a given 
experiment. Recordings were performed at room temperature 
(22° to 25°C) . Series resistance compensation (70 to 85%) was 
employed to minimize voltage error that resulted from pipette 
access resistance, typically 2 to 3.5 megohm. Current signals 
were filtered (-3 dB, 4-pole Bessel) at a frequency of 1/4 to 
1/5 the sampling rate, which ranged from 0.5 to 3 kHz . 
Voltage commands were generated and data were acquired with 
CLAMPEX (pClamp, Axon Instruments, Foster City, CA) . All 
reported data are corrected for linear leak and capacitive 
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components. Exponential fitting of currents was performed 
with CLAMPFIT (Axon Instruments, Foster City, CA) . 
b. Results 

Transfectants were examined for inward Ba 2 * currents 
(J BB ), Cells cotransfected with DNA encoding o lB . lf a 2b , and 0, 2 
subunits expressed high-voltage-activated Ca 2 ' channels. i~* 
first appeared when the membrane" was depolarized from a 
holding potential of -90 mV to -20 mV and peaked in magnitude 
at 10 rnv. Thirty-nine of 95 cells (12 independent 

transfections) had J Ba that ranged from 30 to 2700 pA, with a 
mean of 433 pA. The mean current density was 26 pA/pF, and 
the highest density was 150 P A/pF. The J Ba typically increased 
by 2- to 20 -fold during the first 5 minutes of recording. 
Repeated depolarizations during long records often revealed 
rundown of J Ba usually not exceeding 20% within 10 min. J B 
typically activated within 10 ms and inactivated with both a 
fast time constant ranging from 46 to 105 ms and a slow time 
constant ranging from 291 to 453 ms (n = 3) . Inactivation 
showed a complex voltage dependence, such that J Ba elicited at 
*20 mV inactivated more slowly than j Ba elicited at lower test 
voltages, possibly a result of an increase in the magnitude of 
slow compared to fast inactivation components at higher test 
voltages. 

Recombinant or„. 1 a ab /S 1 . 2 channels were sensitive to holding 
potential. Steady-state inactivation of J Ba , measured after 
a 30- to 60-s conditioning at various holding potentials, was 
approximately 50% at holding potential between -60 and -70 mV 
and approximately 90% at -4 0 mV. Recovery of j Ba from 
inactivation was usually incomplete, measuring 55 to 75% of 
the original magnitude within l min. after the holding 
potential was returned to more negative potentials, possibly 
indicating some rundown or a slow recovery rate. 

Recombinant o 1B . l0 f 2b ^. 2 channels were also blocked 
irreversibly by u-CgTx concentrations ranging from 0.5 to 
10 nM during the time scale of the experiments. Application 
of 5 nM toxin (n = 7) blocked the activity completely within 
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2 min., and no recovery of J Ba was observed after washing 
w-CgTx from the bath for up to 15 min. d 2 * blockage (50 fiM) 
was rapid, complete, and reversible; the DHPs Bay K 8644 
(1 /iM; n = 4) or nifedipine (5 /iM; n = 3) had no discernable 
effect. 

Cells cotransfected with DNA encoding a 1B . lt <* 2b , and 
subunits predominantly displayed a single class of saturable, 
high-affinity co-CgTx binding sites. The determined 

dissociation constant (K^) value was 54.6 ± 14.5 pM (n = 4) . 
Cells transfected with the vector containing only 
/3-galactosidase-encoding DNA or a 2b /3- encoding DNA showed no 
specific binding. The binding capacity (B^J of the 
aiB-i<*2b0-transfected cells was 28,710 ± 11,950 sites per cell 
<n = 4) . 

These results demonstrate that a^B.jOrjb/Si.j-transf ected 
cells express high-voltage-activated, inactivating Ca 2 * channel 
activity that is irreversibly blocked by w-CgTx, insensitive 
to DHPs, and sensitive to holding potential. The activation 
and inactivation kinetics and voltage sensitivity of the 
channel formed in these cells are generally consistent with 
previous characterizations of neuronal N-type Ca 2+ channels. 

F Expression of DNA encoding human neuronal calcium 

channel a 1B . x , a lB _ 2 , a 2B , /S^ and /S^ subunits in HEK 

cells 

Significant Ba 2 * currents were not detected in 
untransfected HEK2 93 cells. Furthermore, untransf ected HEK293 
cells do not express detectable u-CgTx GVIA binding sites. 

In order to approximate the expression of a homogeneous 
population of trimeric a 1B , a 2b and & x protein complexes in 
transfected HEK293 cells, the a 1B , tx 2b and fi, expression levels 
were altered. The efficiency of expression and assembly of 
channel complexes at the cell surface were optimized by 
adjusting the molar ratio of cr lB , a 2b and fi l expression plasmids 
used in the transf ections . The transf ectants were analyzed 
for mRNA levels, w-CgTx GVIA binding and Ca 2 * channel current 
density in order to determine near optimal channel expression 
in the absence of immunological reagents for evaluating 
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protein expression. Higher molar ratios of <* 2b appeared to 
increase calcium channel activity. 
1. Transf ections 
HEK293 cells were maintained in DMEM (Gibco #320-l965AJ) 
5.5% Defined/Supplemented bovine calf serum (Hyclone #A-215i- 
D, 100 U/ml penicillin G and 100 /.g/ml streptomycin. Ca**- 
phosphate based transient transf ections were performed and 
analyzed as described above. Cells were co-transf ected with 
either 8 /ig pcDNAl^., (described in Example II. C) , 5 pg 
pHBCaCHofjA (see, Example IV.B.), 2 fig pHBCaCH/S lb RBS (A) (/?, 2 
expression plasmid; see Examples III. a. and IX. E.), and 2 iig 
pCMV/3-gal [Clontech, Palo Alto, CA) (2:1.8:1 molar ratio of 
Ca 2 * channel subunit expression plasmids) or with 3 ng 
pcDNAlo^., or pcDNAlo^.,, n. 2 5 M g pHB Ca CHo? 2 A , 0.75 or 1.0 ag 
pHBCaCH/J lb RBS(A) or pcDNAl/^ and 2 „g pCMV/3-gal (2:10.9:1 
molar ratio of Ca 2 * channel subunit expression plasmids) 
Plasmid pCMV/3-gal, a S-galactosidase expression plasmid, was 
included in the transf ections as a marker to permit 
transf ection efficiency estimates by histochemical staining. 
When less than three subunits were expressed, pCMVPL2, a pCMV 
promoter- containing vector that lacks a cDNA insert, was 
substituted to maintain equal moles of pCMV-based DNA in the 
transf ection. pUC18 DNA was used to maintain the total mass 
of DNA in the transf ection at 20 /zg/plate. 

RNA from the transfected cells was analyzed by Northern 
blot analysis for calcium channel subunit mRNA expression 
using random primed 32 P-labeled subunit specific probes 
HEK293 cells co-transf ected with a 2b and ft. a expression 

plasmids (8, 5 and 2 W , respectively; molar ratio = 2:1.8:1) 
did not express equivalent levels of each Ca- channel subunit 
mRNA. Relatively high levels of and fi x . a mRNAs were 

expressed, but significantly lower levels of a 2b mRNA were 
expressed. Based on autoradiograph exposures required to 
produce equivalent signals for all three mRNAs , a 2b transcript 
levels were estimated to be 5 to 10 times lower than a „ 
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(S x _ 2 transcript levels. Untransf ected HEK293 cells did not 
express detectable levels of a 1B . 1( a 2i> , or 0 X _ 2 mRNAs . 

To achieve equivalent Ca 2 ' channel subunit mRNA expression 
levels, a series of transf ections was performed with various 
amounts of a„. x . a 2b and expression plasmids . Because the 

«„ x and fi, 2 mRNAs were expressed at very high levels compared 
to a 2b mRNA, the mass of « 1M and B,., plasmids was lowered and 
the mass of a 2b plasmid was increased in the transfection 
experiments. Co- transfection with 3, 11.25 and 0.75 ng of «„. 

a 2b and expression plasmids, respectively (molar ratio 

= 2:10.9:1), approached equivalent expression levels of each 
Ca 2 * channel subunit mRNA. The relative molar quantity of a 2b 
expression plasmid to a lB . x and G x . 2 expression plasmids was 
increased 6-fold. The mass of and fi^ plasmids in the 

transfection was decreased 2.67-fold and the mass of a 2b 
plasmid was increased 2.25-fold. The 6-fold molar increase of 
a 2b relative to a lB . x and S x . 2 required to achieve near equal 
abundance mRNA levels is consistent with the previous 5- to 
10-fold lower estimate of relative a 2b mRNA abundance. w-CgTx 
GVIA binding to cells transfected with various amounts of 
expression plasmids indicated that the 3, 11.25 and 0.75 ^ of 
a B1 , a 2b and fi,. a plasmids, respectively, improved the level of 
cell surface expression of channel complexes. Further 
increases in the mass of a 2b and S x . 2 expression plasmids while 
a x was held constant, and alterations in the mass of the 
expression plasmid while a 2b and S x _ 2 were held constant, 
indicated that the cell surface expression of u-CgTx GVIA 
binding sites per cell was nearly optimal. All subsequent 
transf ections were performed with 3, 11.25 and 0.75 ,g or 1.0 
M g of or « 1M . a 2b and fi„ or 0 lO expression plasmids, 

respectively. 

2 " 5 I-a>-CgTx GVIA binding to transfected cells 

Statistical analysis of the K d and Bmax values was 
performed using one-way analysis of variance (ANOVA) followed 
by the Tukey-Kramer test for multiple pairwise comparisons 
(ps0.05) . 
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Combinations of human voltage -dependent Ca 2 ' channel 
subunits, «„. lf 0lB . 2> tt2b/ fiM and pi3i were analyzed for 
saturation binding of » 5 I- w - C gTx GVIA. About 200,000 cells 
were used per assay, except for the a 18 . lf ttlM , a 1B . l0 , 2b and a 1B 
2 <* 2b combinations which were assayed with l x 10 6 cells per tube" 
The transfected cells displayed a single-class of saturable 
high-affinity binding sites. The values for the dissociation 
constants (KJ and binding capacities < Bmax ) were determined for 
the different combinations. The results are summarized as 
follows : 



(sites/cell) 



Subunit Combination K<s (pM) 

54.9 ± H.i (n =4) 45,324 ± 15,606 

91,004 ± 37,654 



-i««A- 3 53.2 ± 3.6 (n=3) 

17.9 ± 1.9 (n=3) 
a iB-iPi. 3 17.9 ± 1.6 (n=3) 



5,756 ± 2,163 
8,729 ± 2,980 

°»-»«»b 84/6 ±15.3 (n=3) 2 ,256 ± 356 

3 l-7 ± 4.2 (n=3) 757 ± 128 

i B - 2 <* 2 b0i- 2 53.0 ± 4.8 (n=3) 19,371 ± 3,798 

ib-»««A. 3 44.3 ± 8.1 (n=3) 37,652 ± 8,129 

16.4 ± 1.2 (n-3) 2/ i 25 ± 412 

"i-^io 22.2 ± 5.8 (n-3) 2,944 ± 1,168 

"aB-z^b N.D.* (n=3) 



N.D. 



* N.D. = not^delectable N " D ' N - D - 

Cells transfected with subunit combinations lacking 
either the a u , or the a IB . 2 subunit did not exhibit any 
detectable 125 I-cu-CgTx GVIA binding U 600 sites/cell) . "*l-<„- 
CgTx GVIA binding to HEK293 cells transfected with a 1B _ 2 alone 
or a 1B . 2 Qf 2b was too low for reliable Scatchard analysis of the 
data. Comparison of the K, and B rax values revealed several 
relationships between specific combinations of subunits and 
the binding affinities and capacities of the transfected 
cells. in cells transfected with all three subunits, (a 1B . 
i« 2b ^- 2 -, owoWJxo-, QWWJ,.,-, or CY^c^/S^-transfectants) the 
K,, values were indistinguishable (p>0.05), ranging from 44.3 
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± 8.1 pM to 54.9 ± 11.1 pM. In cells transfected with two- 
subunit combinations lacking the a 2b subunit fa 1B . a Px. 2 , oi 1B . 1 /3 1 . 3 , 
aiB-2/Si- 2 or «iB-a/Si-3) the Kd values were significantly lower than 
the three-subunit combinations (p<0.01), ranging from 16.4 ± 
1.2 to 22.2 ± 5.8 pM. Cells transfected with only the 
subunit had a K d value of 31.7 + 4.2 pM, a value that was not 
different from the two-subunit combinations lacking a 2b 
(p<0.05). As with the comparison between the four a 1B a 2h P 1 
versus a lB 0j combinations, when the a 1B _! was co-expressed with 
a 2b , the K d increased significantly (p<0.05) from 31.7 ± 4.2 to 
84.6 ± 5.3 pM. These data demonstrate that co-expression of 
the a 2b subunit with et^-i* «i»-i0i- a . of«-i^i-3» ^is-aPi-a or a»-iPi-s 
subunit combinations results in lower binding affinity of the 
cell surface receptors for 12S I-u>-CgTx GVIA. The values 

of cells transfected with various subunit combinations also 
differed considerably. Cells transfected with the subunit 
alone expressed a low but detectable number of binding sites 
(approximately 750 binding sites/cell) . When the subunit 
was co-expressed with the a 2b subunit, the binding capacity 
increased approximately three-fold while co-expression of a fi x . 
2 or ^ subunit with a 1B . y resulted in B- to 10-fold higher 
expression of surface binding. Cells transfected with all 
three subunits expressed the highest number of cell surface 
receptors. The binding capacities of cells transfected with 
a lB . i a lb 0 1 . 3 or a 1B . 2 a ab /S a _ 3 combinations were approximately two- fold 
higher than the corresponding combinations containing the 0^ 
subunit. Likewise, cells transfected with a^ct^.t or 
a 1B . 1 a ab |B l _3 combinations expressed approximately 2.5-fold more 
binding sites per cell than the corresponding combinations con- 
taining o lB . 2 . In all cases, co-expression of the a 2b 
subunit with a 1B and /3, increased the surface receptor density 
compared to cells transfected with only the corresponding o„ and 
& combinations; approximately 8-fold for a^a^^. 10-fold for 
«».i«2b0i->. 9-fold for a^a^fi^. and 13-fold for a„. a « a A.,. Thus, 
comparison of the values suggests that the toxin-binding 

subunit, a„, or a lB .„ is more efficiently expressed and 
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assembled on the cell surface when co-ex-pressed with either the 
Qf 2b or the ft., or /3 10 subunit, and most efficiently expressed 
when a 2b and p 1 subunits are present. 
3 . Eleetrophysiology 

Functional expression of a SM _ 1 a al fi 1 . i and n ll . I g 1 . J subunit 
combinations was evaluated using the whole-cell recording 
technique. Transfected cells that had no contacts with 
surrounding cells and simple morphology were used approximately 
4 8 hours after transfection for recording. The pipette 
solution was (in mM) 135 CsCl, 10 EGTA, 1 MgCl 2 , 10 HEPES , and 
4 mM Mg-ATP (pH 7.3, adjusted with TEA-OH) . The external 
solution was (in mM) 15 BaCl 2/ 125 Choline Cl, 1 MgCl 2 , and 10 
HEPES (pH 7.3, adjusted with TEA-OH) . w-CgTx GVIA (Bachem) was 
prepared in the external solution with 0.1% cytochrome C 
(Sigma) to serve as a carrier. Control experiments showed that 
cytochrome C had no effect on the Ba** current. 

The macroscopic electrophysiological properties of Ba 2 * 
currents in cells transfected with various amounts of the a 3b 
expression plasmid with the relative amounts of and 0^ 

plasmids held constant were examined. The amplitudes and 
densities of the Ba 2 ' currents (15 mM BaCl 2 ) recorded from whole 
cells of these transf ectants differed dramatically. The 
average currents from 7 to n cells of three types of 
transf ections (no or 2b ; 2:1.8:1 [ a »., molar ratio; and 
2:10.9:1 [o aB . 1 :flr ab :fl 1 . a ] molar ratio) were determined. The 
smallest currents (range: 10 to 205 pA) were recorded when a 2t) 
was not included in the transfection, and the largest currents 
(range: 50 to 8300 pA) were recorded with the 2:10.9:1 ratio of 
«iB-i«ab0i- a Plasmids, the ratio that resulted in near equivalent 
mRNA levels for each subunit transcript . When the amount of a 2b 
plasmid was adjusted to yield approximately an equal abundance 
of subunit mRNAs , the average peak Ba s - current increased from 
433 pA to 1,824 pA (4.2-fold) with a corresponding increase in 
average current density from 26 pA/pF to 127 pA/pF (4.9-fold). 
This increase is in the presence of a 2.7-fold decrease in the 
mass of of^.j and expression plasmids in the transf ections . 
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In all transf ections , the magnitudes of the Ba 24 currents did 
not follow a normal distribution. 

To compare the subunit combinations and determine the 
effects of Qf Jb/ the current -voltage properties of cells 
transf ected with a^.^.2 or with a 1B . 1 a 2b S 1 . 2 in either the 2:1.8:1 
(aiB-i :fl, 2b:0i-a) molar ratio or the 2:10.9:1 {a lB _,. :a 2b : P^) molar 
ratio transf ectants were examined. The extreme examples of no 
a 2b and 11.25 /zg a 2b (2:10.9:1 molar ratio) showed no significant 
differences in the current voltage plot at test potentials 
between 0 mV and +40 mV (p<0.05). The slight differences 
observed at either side of the peak region of the current 
voltage plot were likely due to normalization. The very small 
currents observed in the a^.^., transfected cells have a 
substantially higher component of residual leak relative to the 
barium current that is activated by the test pulse. When the 
current voltage plots are normalized, this leak is a much 
greater component than in the a^oiibPi-i transfected cells and 
as a result, the current -voltage plot appears broader. This is 
the most likely explanation of the apparent differences in the 
current voltage plots, especially given the fact that the 
current-voltage plot for the a lB . 1 P 1 . 2 transfected cells diverge 
on both sides of the peak. Typically, when the voltage- 
dependence activation is shifted, the entire current -voltage 
plot is shifted, which was not observed. To qualitatively 
compare the kinetics of each, the average responses of test 
pulses from -90 mV to 10 mV were normalized and plotted. No 
significant differences in activation or inactivation kinetics 
of whole-cell Ba 2 * currents were observed with any combination. 

G. Expression of DNA encoding human neuronal calcium 
channel a lE - 3 a aB0i-3 and ctM.&vPi-i subunits in HEK cells 

Functional expression of the a 1E . 1 a 2B P 1 ^ and a lB . 3 a 2B 0 1 . 3 , as 
well as a 1E _ 3 was evaluated using the whole cell recording 
technique . 

1 . Methods 



WO 95/04822 



PCT/US94/09230 



-112- 



Recordings were performed on transiently transfected HEK 
293 cells two days following the transf ection, from cells that 
had no contacts with surrounding cells and which had simple 
morphology. 

The internal solution used to fill pipettes for recording 
the barium current from the transfected recombinant calcium 
channels was (in mM) 135 CsCl, 10 EGTA, 1 MgCl 2 , 10 HEPES, and 

4 mM Mg-ATP (pH 7.4-7.5, adjusted with TEA-OH) . The external 
solution for recording the barium current was (in mM) 15 BaCl 2 , 
150 Choline Cl , 1 MgCl 2 , and 10 HEPES and 5 TEA-OH (pH 7.3, 
adjusted with TEA-OH) . in experiments in which Ca J+ was 
replaced for Ba 2 \ a Laminar flow chamber was used in order to 
completely exchange the extracellular solution and prevent any 
mixing of Ba 2 * and Ca 2+ . w-CgTx GVIA was prepared in the 
external solution with 0.1% cytochrome C to serve as a carrier, 
the toxin was applied by pressurized puffer pipette. Series 
resistance was compensated 70-85% and currents were analyzed 
only if the voltage error from series resistance was less than 

5 mV. Leak resistance and capacitance was corrected by 
subtracting the scaled current observed with the P/-4 protocol 
as implemented by pClamp (Axon Instruments) . 

2. Electrophysiology Results 

Cells transfected with ot^a^^ or cr 1E _,a ab |8 lo showed strong 
barium currents with whole cell patch clamp recordings. Cells 
expressing a 1E _ 3 a 21 A- 3 had larger peak currents than those 
expressing a 1B . 1 a ah /B 1 ., . In addition, the kinetics of activation 
and inactivation are clearly substantially faster in the cells 
expressing a 1E calcium channels. HEK 293 cells expressing cr 1E _ 3 
alone have a significant degree of functional calcium channels, 
with properties similar to those expressing a^o^/J^ but with 
substantially smaller peak barium currents. Thus, with a 1E , the 
a 2 and ^ 1 subunits are not reguired for functional expression of 
ot 1E mediated calcium channels, but do substantially increase the 
number of functional calcium channels. 

Examination of the current voltage properties of a^a^P^ 3 
expressing cells indicates that a 1E . 3 o- 2b ,Si-3 is a high-voltage 
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activated calcium channel and the peak current is reached at a 
potential only slightly less positive than other neuronal 
calcium channels also expressing a 2b and (3 lt and a 1B and a 1D . 
Current voltage properties of ot 1B . 1 a 2b P 1 . 3 and otiz-3 a 2bPi-3 are 
statistically different from those of a 1B _ 1 a ab /J 1 . 3 . Current 
voltage curves for a^a^P^ and o 1E . 3 o 3b (8 1 . s peak at approximately 
+5mV, as does the current voltage curve for a 1E . 3 alone. 

The kinetics and voltage dependence of inactivation using 
both prepulse (200 ms) and steady-state inactivation was 
examined. a 1E mediated calcium channels are rapidly inactivated 
relative to previously cloned calcium channels and other high 
voltage-activated calcium channels. otiE-i^hPi-i mediated calcium 
channels are inactivated rapidly and are thus sensitive to 
relatively brief (200 ms) prepulses as well as long prepulses 
(>20s steady state inactivation), but recover rapidly from 
steady state inactivation. The kinetics of the rapid 
inactivation has two components, one with a time constant of 
approximately 25 ms and the other approximately 400 ms. 

To determine whether a 1B mediated calcium channels have 
properties of low voltage activated calcium channels, the 
details of tail currents activated by a test pulse ranging -60 
to +90 mv were measured at -60 mV. Tail currents recorded at - 
60 mV could be well fit by a single exponential of 150 to 300 
Us; at least an order of magnitude faster than those typically 
observed with low voltage- activated calcium channels. 

HEK 293 cells expressing a^jOr^-a flux more current with 
Ba 2 * as the charge carrier and currents carried by Ba 2+ and Ca 2+ 
have different current -voltage properties. Furthermore, the 
time course of inactivation is slower and the amount of 
prepulse inactivation less with Ca 2+ as the charge carrier. 

While the invention has been described with some 
specificity, modifications apparent to those with ordinary 
skill in the art may be made without departing from the scope 
of the invention. Since such modifications will be apparent to 
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those of skill in the art, it is intended that this invention 
be limited only by the scope of the appended claims. 



WO 95/04822 



PCT/US94/09230 



-115- 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: THE SALK INSTITUTE BIOTECHNOLY/INDUSTRIAL ASSOCIATES 

(B) STREET: 505 COAST BLVD SOUTH, SUITE 3 00 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 92037 



(ii) TITLE OF INVENTION: HUMAN CALCIUM CHANNEL COMPOSITIONS AND 
METHODS 

(iii) NUMBER OF SEQUENCES: 38 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/149,097 

(B) FILING DATE: 5-NOV-1993 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/105,536 

(B) FILING DATE: ll-AUG-1993 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 511.. 6996 

(ix) FEATURE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION: 1. .510 

(ix) FEATURE: 
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(A) NAME/KEY: 3'UTR 

(B) LOCATION: 6994 .. 7635 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGGCGAGCGC CTCCGTCCCC GGATGTGAGC TCCGGCTGCC CGCGGTCCCG AGCCAGCGGC 60 

GCGCGGGCGG CGGCGGCGGG CACCGGGCAC CGCGGCGGGC GGGCAGACGG GCGGGCATGG 12 0 

GGGGAGCGCC GAGCGGCCCC GGCGGCCGGG CCGGCATCAC CGCGGCGTCT CTCCGCTAGA 180 

GGAGGGGACA AGCCAGTTCT CCTTTGCAGC AAAAAATTAC ATGTATATAT TATTAAGATA 24 0 

ATATATACAT TGGATTTTAT TTTTTTAAAA AGTTTATTTT GCTCCATTTT TGAAAAAGAG 300 

AGAGCTTGGG TGGCGAGCGG TTTTTTTTTA AAATCAATTA TCCTTATTTT CTGTTATTTG 360 

TCCCCGTCCC TCCCCACCCC CCTGCTGAAG CGAGAATAAG GGCAGGGACC GCGGCTCCTA 42 0 

CCTCTTGGTG ATCCCCTTCC CCATTCCGCC CCCGCCCCAA CGCCCAGCAC AGTGCCCTGC 480 

ACACAGTAGT CGCTCAATAA ATGTTCGTGG ATG ATG ATG ATG ATG ATG ATG AAA 534 
Met Met Met Met Met Met Met Lys 
1 5 

AAA ATG CAG CAT CAA CGG CAG CAG CAA GCG GAC CAC GCG AAC GAG GCA 582 
Lys Met Gin His Gin Arg Gin Gin Gin Ala Asp His Ala Asn Glu Ala 
10 15 20 

AAC TAT GCA AGA GGC ACC AGA CTT CCT CTT TCT GGT GAA GGA CCA ACT 630 
Asn Tyr Ala Arg Gly Thr Arg Leu Pro Leu Ser Gly Glu Gly Pro Thr 
25 30 35 ' 40 

TCT CAG CCG AAT AGC TCC AAG CAA ACT GTC CTG TCT TGG CAA GCT GCA 678 
Ser Gin Pro Asn Ser Ser Lys Gin Thr Val Leu Ser Trp Gin Ala Ala 
45 50 55 

ATC GAT GCT GCT AGA CAG GCC AAG GCT GCC CAA ACT ATG AGC ACC TCT 726 
lie Asp Ala Ala Arg Gin Ala Lys Ala Ala Gin Thr Met Ser Thr Ser 
60 65 70 

GCA CCC CCA CCT GTA GGA TCT CTC TCC CAA AGA AAA CGT CAG CAA TAC 774 
Ala Pro Pro Pro Val Gly Ser Leu Ser Gin Arg Lys Arg Gin Gin Tyr 
75 80 85 

GCC AAG AGC AAA AAA CAG GGT AAC TCG TCC AAC AGC CGA CCT GCC CGC 822 
Ala Lys Ser Lys Lys Gin Gly Asn Ser Ser Asn Ser Arg Pro Ala Arg 
90 95 100 

GCC CTT TTC TGT TTA TCA CTC AAT AAC CCC ATC CGA AGA GCC TGC ATT 870 
Ala Leu Phe Cys Leu Ser Leu Asn Asn Pro lie Arg Arg Ala Cvs He 
105 no us ' 120 

AGT ATA GTG GAA TGG AAA CCA TTT GAC ATA TTT ATA TTA TTG GCT ATT 918 
Ser He Val Glu Trp Lys Pro Phe Asp He Phe He Leu Leu Ala He 
125 130 135 
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TTT GCC AAT TGT GTG GCC TTA GCT ATT TAC ATC CCA TTC CCT GAA GAT 
Phe Ala Asn Cys Val Ala Leu Ala He Tyr He Pro Phe Pro Glu Asp 
140 145 150 

GAT TCT AAT TCA ACA AAT CAT AAC TTG GAA AAA GTA GAA TAT GCC TTC 
Asp Ser Asn Ser Thr Asn His Asn Leu Glu Lys Val Glu Tyr Ala Phe 
155 160 165 

CTG ATT ATT TTT ACA GTC GAG ACA TTT TTG AAG ATT ATA GCG TAT GGA 
Leu He He Phe Thr Val Glu Thr Phe Leu Lys He He Ala Tyr Gly 
170 175 180 

TTA TTG CTA CAT CCT AAT GCT TAT GTT AGG AAT GGA TGG AAT TTA CTG 
Leu Leu Leu His Pro Asn Ala Tyr Val Arg Asn Gly Trp Asn Leu Leu 
185 190 195 200 

GAT TTT GTT ATA GTA ATA GTA GGA TTG TTT AGT GTA ATT TTG GAA CAA 
Asp Phe Val He Val He Val Gly Leu Phe Ser Val He Leu Glu Gin 
205 210 215 

TTA ACC AAA GAA ACA GAA GGC GGG AAC CAC TCA AGC GGC AAA TCT GGA 
Leu Thr Lys Glu Thr Glu Gly Gly Asn His Ser Ser Gly Lys Ser Gly 
220 225 230 

GGC TTT GAT GTC AAA GCC CTC CGT GCC TTT CGA GTG TTG CGA CCA CTT 
Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu 
235 240 245 

CGA CTA GTG TCA GGA GTG CCC AGT TTA CAA GTT GTC CTG AAC TCC ATT 
Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val Val Leu Asn Ser He 
250 255 260 

ATA AAA GCC ATG GTT CCC CTC CTT CAC ATA GCC CTT TTG GTA TTA TTT 
He Lys Ala Met Val Pro Leu Leu His He Ala Leu Leu Val Leu Phe 
265 270 275 280 

GTA ATC ATA ATC TAT GCT ATT ATA GGA TTG GAA CTT TTT ATT GGA AAA 
Val He He He Tyr Ala He He Gly Leu Glu Leu Phe He Gly Lys 
285 290 295 

ATG CAC AAA ACA TGT TTT TTT GCT GAC TCA GAT ATC GTA GCT GAA GAG 
Met His Lys Thr Cys Phe Phe Ala Asp Ser Asp He Val Ala Glu Glu 
300 305 310 

GAC CCA GCT CCA TGT GCG TTC TCA GGG AAT GGA CGC CAG TGT ACT GCC 
Asp Pro Ala Pro Cys Ala Phe Ser Gly Asn Gly Arg Gin Cys Thr Ala 
315 " 320 325 

AAT GGC ACG GAA TGT AGG AGT GGC TGG GTT GGC CCG AAC GGA GGC ATC 
Asn Gly Thr Glu Cys Arg Ser Gly Trp Val Gly Pro Asn Gly Gly He 
330 335 340 

ACC AAC TTT GAT AAC TTT GCC TTT GCC ATG CTT ACT GTG TTT CAG TGC 
Thr Asn Phe Asp Asn Phe Ala Phe Ala Met Leu Thr Val Phe Gin Cys 
345 350 355 360 
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iTf Thr «I? S? C l GG A S A GAC GTG CTC TAC TGG ATG ** T GAT Q CT 

He Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Met Asn Asp Ala 
365 370 375 

tl G ZZ T TTG CCC TGG GTG TAT GTC AGT CT C GTC ATC TTT 

Met Gly Phe Glu Leu Pro Trp Val Tyr Phe Val Ser Leu Val He Phe 
380 385 390 

GGG' TCA TTT TTC GTA CTA AAT CTT GTA CTT GGT GTA TTG AGC GGA GAA 
Gly ser Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 
395 400 405 

TTC TCA AAG GAA AGA GAG AAG GCA AAA GCA CGG GGA GAT TTC CAG AAG 
Phe Ser Lys Glu Arg Glu Lys Ala Lys Ala Arg Gly Asp Phe Gin Lys 
410 415 420 

CTC CGG GAG AAG CAG CAG CTG GAG GAG GAT CTA AAG GGC TAC TTG GAT 
Leu Arg Glu Lys Gin Gin Leu Glu Glu Asp Leu Lys Gly Tyr Leu Asp 
425 430 435 440 

TGG ATC ACC CAA GCT GAG GAC ATC GAT CCG GAG AAT GAG GAA GAA GGA 
Trp He Thr Gin Ala Glu Asp He Asp Pro Glu Asn Glu Glu Glu Gly 
445 450 455 

S? A 2 AG GGC *** CGA ** T ACT AGC ATG ccc A CC AGC GAG ACT GAG 

Gly Glu Glu Gly Lys Arg Asn Thr Ser Met Pro Thr Ser Glu Thr Glu 
460 465 470 

TCT GTG AAC ACA GAG AAC GTC AGC GGT GAA GGC GAG AAC CGA GGC TGC 
Ser Val Asn Thr Glu Asn Val Ser Gly Glu Gly Glu Asn Arg Gly Cvs 
475 480 485 

TGT GGA AGT CTC TGT CAA GCC ATC TCA AAA TCC AAA CTC AGC CGA CGC 
Cys Gly Ser Leu Cys Gin Ala He Ser Lys Ser Lys Leu Ser Arg Arg 
490 495 500 

TGG CGT CGC TGG AAC CGA TTC AAT CGC AGA AGA TGT AGG GCC GCC GTG 
Trp Arg Arg Trp Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val 
505 sio 515 520 

AAG TCT GTC ACG TTT TAC TGG CTG GTT ATC GTC CTG GTG TTT CTG AAC 
Lys Ser Val Thr Phe Tyr Trp Leu Val He Val Leu Val Phe Leu Asn 
525 530 535 

ACC TTA ACC ATT TCC TCT GAG CAC TAC AAT CAG CCA GAT TGG TTG ACA 
Thr Leu Thr He Ser Ser Glu His Tyr Asn Gin Pro Asp Trp Leu Thr 
540 545 550 

CAG ATT CAA GAT ATT GCC AAC AAA GTC CTC TTG GCT CTG TTC ACC TGC 
Gin He Gin Asp He Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys 
555 560 565 

GAG ATG CTG GTA AAA ATG TAC AGC TTG GGC CTC CAA GCA TAT TTC GTC 
Glu Met Leu Val Lys Met Tyr Ser Leu Gly Leu Gin Ala Tyr Phe Val 
570 575 580 
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TCT CTT TTC AAC CGG TTT GAT TGC TTC GTG GTG TGT GGT GGA ATC ACT 
Ser Leu Phe Asn Arg Phe Asp Cys Phe Val Val Cys Gly Gly He Thr 
585 590 595 600 

GAG ACG ATC TTG GTG GAA CTG GAA ATC ATG TCT CCC CTG GGG ATC TCT 
Glu Thr He Leu Val Glu Leu Glu He Met Ser Pro Leu Gly He Ser 
605 610 615 

GTG TTT CGG TGT GTG CGC CTC TTA AGA ATC TTC AAA GTG ACC AGG CAC 
Val Phe Arg Cys Val Arg Leu Leu Arg He Phe Lys Val Thr Arg His 
620 625 630 

TGG ACT TCC CTG AGC AAC TTA GTG GCA TCC TTA TTA AAC TCC ATG AAG 
Trp Thr Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys 
635 640 645 

TCC ATC GCT TCG CTG TTG CTT CTG CTT TTT CTC TTC ATT ATC ATC TTT 
Ser He Ala Ser Leu Leu Leu Leu Leu Phe Leu Phe He He He Phe 
650 655 660 

TCC TTG CTT GGG ATG CAG CTG TTT GGC GGC AAG TTT AAT TTT GAT GAA 
Ser Leu Leu Gly Met Gin Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu 
665 670 675 680 

ACG CAA ACC AAG CGG AGC ACC TTT GAC AAT TTC CCT CAA GCA CTT CTC 
Thr Gin Thr Lys Arg Ser Thr Phe Asp Asn Phe Pro Gin Ala Leu Leu 
685 690 695 

ACA GTG TTC CAG ATC CTG ACA GGC GAA GAC TGG AAT GCT GTG ATG TAC 
Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr 
700 705 710 

GAT GGC ATC ATG GCT TAC GGG GGC CCA TCC TCT TCA GGA ATG ATC GTC 
Asp Gly He Met Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met He Val 
715 720 725 

TGC ATC TAC TTC ATC ATC CTC TTC ATT TGT GGT AAC TAT ATT CTA CTG 
Cys He Tyr Phe He He Leu Phe He Cys Gly Asn Tyr He Leu Leu 
730 735 740 

AAT GTC TTC TTG GCC ATC GCT GTA GAC AAT TTG GCT GAT GCT GAA AGT 
Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asp Ala Glu Ser 
745 750 755 760 

CTG AAC ACT GCT CAG AAA GAA GAA GCG GAA GAA AAG GAG AGG AAA AAG 
Leu Asn Thr Ala Gin Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys 
765 770 775 

ATT GCC AGA AAA GAG AGC CTA GAA AAT AAA AAG AAC AAC AAA CCA GAA 
He Ala Arg Lys Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu 
780 785 790 

GTC AAC CAG ATA GCC AAC AGT GAC AAC AAG GTT ACA ATT GAT GAC TAT 
Val Asn Gin He Ala Asn Ser Asp Asn Lys Val Thr He Asp Asp Tyr 
795 800 805 
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AGA GAA GAG GAT GAA GAC AAG GAC CCC TAT CCG CCT TGC GAT GTG CCA 
Arg Glu Glu Asp Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro 
810 815 820 

GTA GGG GAA GAG GAA GAG GAA GAG GAG GAG GAT GAA CCT GAG GTT CCT 
Val Gly Glu Glu Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro 
825 830 835 840 

GCC GGA CCC CGT CCT CGA AGG ATC TCG GAG TTG AAC ATG AAG GAA AAA 
Ala Gly Pro Arg Pro Arg Arg lie Ser Glu Leu Asn Met Lys Glu Lys 
845 850 855 

ATT GCC CCC ATC CCT GAA GGG AGC GCT TTC TTC ATT CTT AGC AAG ACC 
lie Ala Pro He Pro Glu Gly Ser Ala Phe Phe He Leu Ser Lys Thr 
860 865 870 

AAC CCG ATC CGC GTA GGC TGC CAC AAG CTC ATC AAC CAC CAC ATC TTC 
Asn Pro He Arg Val Gly Cys His Lys Leu He Asn His His He Phe 
8? 5 880 885 

ACC AAC CTC ATC CTT GTC TTC ATC ATG CTG AGC AGT GCT GCC CTG GCC 
Thr Asn Leu He Leu Val Phe He Met Leu ser ser Ala Ala Leu Ala 
890 895 900 

GCA GAG GAC CCC ATC CGC AGC CAC TCC TTC C6G AAC ACG ATA CTG GGT 
Ala Glu Asp Pro He Arg Ser His Ser Phe Arg Asn Thr He Leu Gly 
905 910 915 920 

TAC TTT GAC TAT GCC TTC ACA GCC ATC TTT ACT GTT GAG ATC CTG TTG 
Tyr Phe Asp Tyr Ala Phe Thr Ala He Phe Thr Val Glu He Leu Leu 
925 930 935 

AAG ATG ACA ACT TTT GGA GCT TTC CTC CAC AAA GGG GCC TTC TGC AGG 
Lys Met Thr Thr Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg 
940 945 9 50 

AAC TAC TTC AAT TTG CTG GAT ATG CTG GTG GTT GGG GTG TCT CTG GTG 
Asn Tyr Phe Asn Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val 
955 960 965 

TCA TTT GGG ATT CAA TCC AGT GCC ATC TCC GTT GTG AAG ATT CTG AGG 
Ser Phe Gly He Gin Ser Ser Ala He Ser Val Val Lys He Leu Arg 
970 975 980 

GTC TTA AGG GTC CTG CGT CCC CTC AGG GCC ATC AAC AGA GCA AAA GGA 
Val Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg Ala Lys Glv 
985 990 995 i 0 00 

CTT AAG CAC GTG GTC CAG TGC GTC TTC GTG GCC ATC CGG ACC ATC GGC 
Leu Lys His Val Val Gin Cys Val Phe Val Ala He Arg Thr He Gly 
1005 1010 1015 

AAC ATC ATG ATC GTC ACC ACC CTC CTG CAG TTC ATG TTT GCC TGT ATC 
Asn He Met He Val Thr Thr Leu Leu Gin Phe Met Phe Ala Cys He 
1020 1025 1030 
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GGG GTC CAG TTG TTC AAG GGG AAG TTC TAT CGC TGT ACG GAT GAA GCC 3654 
Gly Val Gin Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 
1035 1040 1045 

AAA AGT AAC CCT GAA GAA TGC AGG GGA CTT TTC ATC CTC TAC AAG GAT 3702 
Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe lie Leu Tyr Lys Asp 
1050 1055 1060 

GGG GAT GTT GAC AGT CCT GTG GTC CGT GAA CGG ATC TGG CAA AAC AGT 3750 
Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg He Trp Gin Asn Ser 
1065 1070 1075 1080 

GAT TTC AAC TTC GAC AAC GTC CTC TCT GCT ATG ATG GCG CTC TTC ACA 3 798 

Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala Leu Phe Thr 
1085 1090 1095 

GTC TCC ACG TTT GAG GGC TGG CCT GCG TTG CTG TAT AAA GCC ATC GAC 3846 
Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr Lys Ala He Asp 
1100 1105 1110 

TCG AAT GGA GAG AAC ATC GGC CCA ATC TAC AAC CAC CGC GTG GAG ATC 3894 
Ser Asn Gly Glu Asn lie Gly Pro He Tyr Asn His Arg Val Glu He 
1115 1120 1125 

TCC ATC TTC TTC ATC ATC TAC ATC ATC ATT GTA GCT TTC TTC ATG ATG 3942 
Ser He Phe Phe He He Tyr He He He Val Ala Phe Phe Met Met 
1130 1135 1140 

AAC ATC TTT GTG GGC TTT GTC ATC GTT ACA TTT CAG GAA CAA GGA GAA 3990 
Asn He Phe Val Gly Phe Val He Val Thr Phe Gin Glu Gin Gly Glu 
1145 1150 1155 1160 

AAA GAG TAT AAG AAC TGT GAG CTG GAC AAA AAT CAG CGT CAG TGT GTT 4038 
Lys Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg Gin Cys Val 
1165 1170 1175 

GAA TAC GCC TTG AAA GCA CGT CCC TTG CGG AGA TAC ATC CCC AAA AAC 4086 
Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He Pro Lys Asn 
1180 1185 1190 

CCC TAC CAG TAC AAG TTC TGG TAC GTG GTG AAC TCT TCG CCT TTC GAA 4134 
Pro Tyr Gin Tyr Lys Phe Trp Tyr Val Val Asn Ser Ser Pro Phe Glu 
1195 1200 1205 

TAC ATG ATG TTT GTC CTC ATC ATG CTC AAC ACA CTC TGC TTG GCC ATG 4182 
Tyr Met Met Phe Val Leu He Met Leu Asn Thr Leu Cys Leu Ala Met 
1210 1215 1220 

CAG CAC TAC GAG CAG TCC AAG ATG TTC AAT GAT GCC ATG GAC ATT CTG 4230 
Gin His Tyr Glu Gin Ser Lys Met Phe Asn Asp Ala Met Asp He, Leu 
1225 1230 1235 1240 

AAC ATG GTC TTC ACC GGG GTG TTC ACC GTC GAG ATG GTT TTG AAA GTC 4278 
Asn Met Val Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val 
1245 1250 1255 
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ATC GCA TTT AAG CCT AAG GGG TAT TTT AGT GAC GCC TGG AAC ACG TTT 
lie Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe 
1260 1265 1270 

GAC TCC CTC ATC GTA ATC GGC AGC ATT ATA GAC GTG GCC CTC AGC GAA 
Asp Ser Leu lie Val He Gly Ser He He Asp Val Ala Leu Ser Glu 
1275 1280 1285 

GCA GAC CCA ACT GAA AGT GAA AAT GTC CCT GTC CCA ACT GCT ACA CCT 
Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr Pro 
1290 1295 1300 

GGG AAC TCT GAA GAG AGC AAT AGA ATC TCC ATC ACC TTT TTC CGT CTT 
Gly Asn Ser Glu Glu Ser Asn Arg He Ser He Thr Phe Phe Arg Leu 
1305 1310 1315 1320 

TTC CGA GTG ATG CGA TTG GTG AAG CTT CTC AGC AGG GGG GAA GGC ATC 
Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He 
1325 1330 1335 

CGG ACA TTG CTG TGG ACT TTT ATT AAG TTC TTT CAG GCG CTC CCG TAT 
Arg Thr Leu Leu Trp Thr Phe He Lys Phe Phe Gin Ala Leu Pro Tyr 

134fl n<r J 



GTG GCC CTC CTC ATA GCC ATG CTG TTC TTC ATC TAT GCG GTC ATT GGC 
Val Ala Leu Leu He Ala Met Leu Phe Phe He Tyr Ala Val He Glv 
1355 1360 1365 

ATG CAG ATG TTT GGG AAA GTT GCC ATG AGA GAT AAC AAC CAG ATC AAT 
Met Gin Met Phe Gly Lys Val Ala Met Arg Asp Asn Asn Gin He Asn 
1370 1375 13B0 

AGG AAC AAT AAC TTC CAG ACG TTT CCC CAG GCG GTG CTG CTG CTC TTC 
Arg Asn Asn Asn Phe Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe 
I 385 1390 1395 1400 

AGG TGT GCA ACA GGT GAG GCC TGG CAG GAG ATC ATG CTG GCC TGT CTC 
Arg Cys Ala Thr Gly Glu Ala Trp Gin Glu He Met Leu Ala Cys Leu 
1405 1410 1415 

CCA GGG AAG CTC TGT GAC CCT GAG TCA GAT TAC AAC CCC GGG GAG GAG 
Pro Gly Lys Leu Cys Asp Pro Glu Ser Asp Tyr Asn Pro Gly Glu Glu 
1420 1425 1430 

CAT ACA TGT GGG AGC AAC TTT GCC ATT GTC TAT TTC ATC AGT TTT TAC 
Hxs Thr Cys Gly Ser Asn Phe Ala He Val Tyr Phe He Ser Phe Tyr 
1435 1440 1445 

ATG CTC TGT GCA TTT CTG ATC ATC AAT CTG TTT GTG GCT GTC ATC ATG 
Met Leu Cys Ala Phe Leu He He Asn Leu Phe Val Ala Val He Met 
1450 1455 1460 

GAT AAT TTC GAC TAT CTG ACC CGG GAC TGG TCT ATT TTG GGG CCT CAC 
Asp Asn Phe Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro His 
1465 1470 1475 1480 
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CAT TTA GAT GAA TTC AAA AGA ATA TGG TCA GAA TAT GAC CCT GAG GCA 
His Leu Asp Glu Phe Lys Arg He Trp Ser Glu Tyr Asp Pro Glu Ala 
1485 1490 1495 

AAG GGA AGG ATA AAA CAC CTT GAT GTG GTC ACT CTG CTT CGA CGC ATC 
Lys Gly Arg He Lys His Leu Asp Val Val Thr Leu Leu Arg Arg He 
1500 1505 1510 

CAG CCT CCC CTG GGG TTT GGG AAG TTA TGT CCA CAC AGG GTA GCG TGC 
Gin Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 
1515 1S20 1525 

AAG AGA TTA GTT GCC ATG AAC ATG CCT CTC AAC AGT GAC GGG ACA GTC 
Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val 
1530 1535 1540 

ATG TTT AAT GCA ACC CTG TTT GCT TTG GTT CGA ACG GCT CTT AAG ATC 
Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys He 
1545 1550 1555 1560 

AAG ACC GAA GGG AAC CTG GAG CAA GCT AAT GAA GAA CTT CGG GCT GTG 
Lys Thr Glu Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala Val 
1565 1570 1575 

ATA AAG AAA ATT TGG AAG AAA ACC AGC ATG AAA TTA CTT GAC CAA GTT 
He Lys Lys He Trp Lys Lys Thr Ser Met Lys Leu Leu Asp Gin Val 
1580 1585 1590 

GTC CCT CCA GCT GGT GAT GAT GAG GTA ACC GTG GGG AAG TTC TAT GCC 
Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr Ala 
1595 1600 1605 

ACT TTC CTG ATA CAG GAC TAC TTT AGG AAA TTC AAG AAA CGG AAA GAA 
Thr Phe Leu He Gin Asp Tyr Phe Arg Lys Phe Lys Lys Arg Lys Glu 
1610 1615 1620 

CAA GGA CTG GTG GGA AAG TAC CCT GCG AAG AAC ACC ACA ATT GCC CTA 
Gin Gly Leu Val Gly Lys Tyr Pro Ala Lys Asn Thr Thr He Ala Leu 
1625 1630 1635 1640 

CAG GCG GGA TTA AGG ACA CTG CAT GAC ATT GGG CCA GAA ATC CGG CGT 
Gin Ala Gly Leu Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg 
1645 1650 1655 

GCT ATA TCG TGT GAT TTG CAA GAT GAC GAG CCT GAG GAA ACA AAA CGA 
Ala He Ser Cys Asp Leu Gin Asp Asp Glu Pro Glu Glu Thr Lys Arg 
1660 1665 1670 

GAA GAA GAA GAT GAT GTG TTC AAA AGA AAT GGT GCC CTG CTT GGA AAC 
Glu Glu Glu Asp Asp Val Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn 
1675 1680 1685 

CAT GTC AAT CAT GTT AAT AGT GAT AGG AGA GAT TCC CTT CAG CAG ACC 
His Val Asn His Val Asn Ser Asp Arg Arg Asp Ser Leu Gin Gin Thr 
1690 1695 1700 
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Thr ™ ^ GT CCC CTG ^ GTC °* AGG CCT TCA ATT CCA CCT 

Asn Thr Thr His Arg Pro Leu His Val Gin Arg Pro Ser lie Pro Pro 
1705 1710 1715 1720 

GCA AGT GAT ACT GAG AAA CCG CTG TTT CCT CCA GCA GGA AAT TCG GTG 
Ala Ser Asp Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val 
1725 1730 i 735 

TGT CAT AAC CAT CAT AAC CAT AAT TCC ATA GGA AAG CAA GTT CCC ACC 
Cys His Asn His His Asn His Asn Ser lie Gly Lys Sal Pro ££r 

1740 1745 1750 

sSr" ?S f T r CTC *** GCC ^ ATG TCC AAA G CT ^CC CAT 

Ser Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 
1755 1760 1765 

2?5 ft* CGG £ CC o GC A ? T GGG AAC CTT GAG CAT GTG TCT GAA AAT GGG 
Gly Lys Arg Pro Ser He Gly Asn Leu Glu His Val Ser Glu Asn Gly 
1770 1775 1780 

SI Ss IS « CC 5 AC H AT GAC G GG GAG CCT CAG AGA AGG TCC AGT 

His His Ser Ser His Lys His Asp Arg Glu Pro Gin Arg Arg Ser Ser 
1785 1790 1795 1800 

GTG AAA AGA ACC CGC TAT TAT GAA ACT TAC ATT AGG TCC GAC TCA GGA 
Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr He Arg Ser Asp Ser Gly 
1805 leio " 1815 

So Glu ^ t CTC ACT A T T TGC CGG GAA GAC CCA GAG ATA CAT GGC 

Asp Glu Gin Leu Pro Thr He Cys Arg Glu Asp Pro Glu He His Gly 
I 820 1825 1830 

TAT TTC AGG GAC CCC CAC TGC TTG GGG GAG CAG GAG TAT TTC AGT AGT 
Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gin Glu Tyr Phe Ser Ser 
1 83S 1840 1845 

GAG GAA TGC TAC GAG GAT GAC AGC TCG CCC ACC TGG AGC AGG CAA AAC 
Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro Thr Trp Ser Arg Gin Asn 
1850 1855 i860 

TAT GGC TAC TAC AGC AGA TAC CCA GGC AGA AAC ATC GAC TCT GAG AGG 
Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly Arg Asn He Asp Ser Glu Arg 
1865 1870 1875 1880 

CCC CGA GGC TAC CAT CAT CCC CAA GGA TTC TTG GAG GAC GAT GAC TCG 
Pro Arg Gly Tyr His His Pro Gin Gly Phe Leu Glu Asp Asp Asp Ser 
1885 1890 1895 

CCC GTT TGC TAT GAT TCA CGG AGA TCT CCA AGG AGA CGC CTA CTA CCT 
Pro Val Cys Tyr Asp Ser Arg Arg Ser Pro Arg Arg Arg Leu Leu Pro 
1900 1905 1910 
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CGC CGG CAG AGC AGC CAG GAA GAG GTC CCG TCG TCT CCC ATC TTC CCC 6342 
Arg Arg Gin Ser Ser Gin Glu Glu Val Pro Ser Ser Pro He Phe Pro 
1930 1935 1940 

CAT CGC ACG GCC CTG CCT CTG CAT CTA ATG CAG CAA CAG ATC ATG GCA 63 90 

His Arg Thr Ala Leu Pro Leu His Leu Met Gin Gin Gin He Met Ala 
1945 1950 1955 1960 

GTT GCC GGC CTA GAT TCA AGT AAA GCC CAG AAG TAC TCA CCG AGT CAC 6438 
Val Ala Gly Leu Asp Ser Ser Lys Ala Gin Lys Tyr Ser Pro Ser His 
1965 1970 1975 

TCG ACC CGG TCG TGG GCC ACC CCT CCA GCA ACC CCT CCC TAC CGG GAC 6486 
Ser Thr Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp 
1980 1985 1990 

TGG ACA CCG TGC TAC ACC CCC CTG ATC CAA GTG GAG CAG TCA GAG GCC 6534 
Trp Thr Pro Cys Tyr Thr Pro Leu He Gin Val Glu Gin Ser Glu Ala 
1995 2000 2005 

CTG GAC CAG GTG AAC GGC AGC CTG CCG TCC CTG CAC CGC AGC TCC TGG 6582 
Leu Asp Gin Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser Trp 
2010 2015 2020 

TAC ACA GAC GAG CCC GAC ATC TCC TAC CGG ACT TTC ACA CCA GCC AGC 6630 
Tyr Thr Asp Glu Pro Asp He Ser Tyr Arg Thr Phe Thr Pro Ala Ser 
2025 2030 2035 2040 

CTG ACT GTC CCC AGC AGC TTC CGG AAC AAA AAC AGC GAC AAG CAG AGG 6678 
Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp Lys Gin Arg 
2045 2050 2055 

AGT GCG GAC AGC TTG GTG GAG GCA GTC CTG ATA TCC GAA GGC TTG GGA 672 6 

Ser Ala Asp Ser Leu Val Glu Ala Val Leu He Ser Glu Gly Leu Gly 
2060 2065 2070 

CGC TAT GCA AGG GAC CCA AAA TTT GTG TCA GCA ACA AAA CAC GAA ATC 6774 
Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala Thr Lys His Glu He 
2075 2080 2085 

GCT GAT GCC TGT GAC CTC ACC ATC GAC GAG ATG GAG AGT GCA GCC AGC 6 822 

Ala Asp Ala Cys Asp Leu Thr He Asp Glu Met Glu Ser Ala Ala Ser 
2090 2095 2100 

ACC CTG CTT AAT GGG AAC GTG CGT CCC CGA GCC AAC GGG GAT GTG GGC 6870 
Thr Leu Leu Asn Gly Asn Val Arg Pro Arg Ala Asn Gly Asp Val Gly 
2105 2110 2115 2120 

CCC CTC TCA CAC CGG CAG GAC TAT GAG CTA CAG GAC TTT GGT CCT GGC 6 918 

Pro Leu Ser His Arg Gin Asp Tyr Glu Leu Gin Asp Phe Gly Pro Gly 
2125 2130 2135 

TAC AGC GAC GAA GAG CCA GAC CCT GGG AGG GAT GAG GAG GAC CTG GCG 6966 
Tyr Ser Asp Glu Glu Pro Asp Pro Gly Arg Asp Glu Glu Asp Leu Ala 
2140 2145 2150 
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GAT GAA ATG ATA TGC ATC ACC ACC TTG TAGCCCCCAG CGAGGGGCAG 7013 
Asp Glu Met lie Cys lie Thr Thr Leu 
2155 2160 

ACTGGCTCTG GCCTCAGGTG GGGCGCAGGA GAGCCAGGGG AAAAGTGCCT CATAGTTAGG 7073 

AAAGTTTAGG CACTAGTTGG GAGTAATATT CAATTAATTA GACTTTTGTA TAAGAGATGT 7133 

CATGCCTCAA GAAAGCCATA AACCTGGTAG GAACAGGTCC CAAGCGGTTG AGCCTGGCAG 7193 

AGTACCATGC GCTCGGCCCC AGCTGCAGGA AACAGCAGGC CCCGCCCTCT CACAGAGGAT 7253 

GGGTGAGGAG GCCAGACCTG CCCTGCCCCA TTGTCCAGAT GGGCACTGCT GTGGAGTCTG 7313 

CTTCTCCCAT GTACCAGGGC ACCAGGCCCA CCCAACTGAA GGCATGGCGG CGGGGTGCAG 7373 

GGGAAAGTTA AAGGTGATGA CGATCATCAC ACCTGTGTCG TTACCTCAGC CATCGGTCTA 7433 

GCATATCAGT CACTGGGCCC AACATATCCA TTTTTAAACC CTTTCCCCCA AATACACTGC 7493 

GTCCTGGTTC CTGTTTAGCT GTTCTGAAAT ACGGTGTGTA AGTAAGTCAG AACCCAGCTA 7553 

CCAGTGATTA TTGCGAGGGC AATGGGACCT CATAAATAAG GTTTTCTGTG ATGTGACGCC 7S13 

AGTTTACATA AGAGAATATC AC 7635 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..102 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 . . 104 

(D) OTHER INFORMATION: /note= "A 104 -nucleotide 
alternative exon of alpha -ID. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GTA AAT GAT GCG ATA GGA TGG GAA TGG CCA TGG GTG TAT TTT GTT AGT 4 8 

Val Asn Asp Ala lie Gly Trp Glu Trp Pro Trp Val Tyr Phe Val Ser 
1 5 10 15 

CTG ATC ATC CTT GGC TCA TTT TTC GTC CTT AAC CTG GTT CTT GGT GTC 96 
Leu lie lie Leu Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val 
20 25 30 



WO 95/04822 



PCT/US94/09230 



-127- 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..6492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GTC AAT GAG AAT ACG AGG ATG TAC ATT CCA GAG GAA AAC CAC CAA 
Met Val Asn Glu Asn Thr Arg Met Tyr lie Pro Glu Glu Asn His Gin 
1 5 10 15 

GGT TCC AAC TAT GGG AGC CCA CGC CCC GCC CAT GCC AAC ATG AAT GCC 
Gly Ser Asn Tyr Gly Ser Pro Arg Pro Ala His Ala Asn Met Asn Ala 
20 25 30 

AAT GCG GCA GCG GGG CTG GCC CCT GAG CAC ATC CCC ACC CCG GGG GCT 
Asn Ala Ala Ala Gly Leu Ala Pro Glu His He Pro Thr Pro Gly Ala 
35 40 45 

GCC CTG TCG TGG CAG GCG GCC ATC GAC GCA GCC CGG CAG GCT AAG CTG 
Ala Leu Ser Trp Gin Ala Ala He Asp Ala Ala Arg Gin Ala Lys Leu 
50 55 60 

ATG GGC AGC GCT GGC AAT GCG ACC ATC TCC ACA GTC AGC TCC ACG CAG 
Met Gly Ser Ala Gly Asn Ala Thr He Ser Thr Val Ser Ser Thr Gin 
65 70 75 80 

CGG AAG CGC CAG CAA TAT GGG AAA CCC AAG AAG CAG GGC AGC ACC ACG 
Arg Lys Arg Gin Gin Tyr Gly Lys Pro Lys Lys Gin Gly Ser Thr Thr 
85 90 95 

GCC ACA CGC CCG CCC CGA GCC CTG CTC TGC CTG ACC CTG AAG AAC CCC 
Ala Thr Arg Pro Pro Arg Ala Leu Leu Cys Leu Thr Leu Lys Asn Pro 
100 105 110 

ATC CGG AGG GCC TGC ATC AGC ATT GTC GAA TGG AAA CCA TTT GAA ATA 
He Arg Arg Ala Cys He Ser He Val Glu Trp Lys Pro Phe Glu He 
115 120 125 

ATT ATT TTA CTG ACT ATT TTT GCC AAT TGT GTG GCC TTA GCG ATC TAT 
He He Leu Leu Thr He Phe Ala Asn Cys Val Ala Leu Ala He Tyr 
130 135 140 

ATT CCC TTT CCA GAA GAT GAT TCC AAC GCC ACC AAT TCC AAC CTG GAA 
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lie Pro Phe Pro Glu Asp Asp Ser Asn Ala Thr Asn Ser Asn Leu Glu 
145 150 155 iGO 

CGA GTG GAA TAT CTC TTT CTC ATA ATT TTT ACG GTG GAA GCG TTT TTA 
Arg Val Glu Tyr Leu Phe Leu lie lie Phe Thr Val Glu Ala Phe Leu 
165 170 175 

AAA GTA ATC GCC TAT GGA CTC CTC TTT CAC CCC AAT GCC TAC CTC CGC 
Lys Val lie Ala Tyr Gly Leu Leu Phe His Pro Asn Ala Tyr Leu Arq 
180 185 190 

AAC GGC TGG AAC CTA CTA GAT TTT ATA ATT GTG GTT GTG GGG CTT TTT 
Asn Gly Trp Asn Leu Leu Asp Phe He He Val Val Val Gly Leu Phe 
195 200 205 

AGT GCA ATT TTA GAA CAA GCA ACC AAA GCA GAT GGG GCA AAC GCT CTC 
Ser W^f Ile LeU Glu Gln Ala Thr L V S Ala Asp G1 y Ala Asn Ala Leu 
210 215 220 

GGA GGG AAA GGG GCC GGA TTT GAT GTG AAG GCG CTG AGG GCC TTC CGC 
Gly Gly Lys Gly Ala Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg 
225 230 235 240 

GTG CTG CGC CCC CTG CGG CTG GTG TCC GGA GTC CCA AGT CTC CAG GTG 
Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val 
245 250 255 

GTC CTG AAT TCC ATC ATC AAG GCC ATG GTC CCC CTG CTG CAC ATC GCC 
Val Leu Asn Ser Ile Ile Lys Ala Met Val Pro Leu Leu His Ile Ala 
260 265 270 

CTG CTT GTG CTG TTT GTC ATC ATC ATC TAC GCC ATC ATC GGC TTG GAG 
Leu Leu Val Leu Phe Val Ile Ile Ile Tyr Ala Ile lie Gly Leu Glu 
275 280 285 

CTC TTC ATG GGG AAG ATG CAC AAG ACC TGC TAC AAC CAG GAG GGC ATA 
Leu Phe Met Gly Lys Met His Lys Thr Cys Tyr Asn Gin Glu Gly Ile 
290 295 300 

GCA GAT GTT CCA GCA GAA GAT GAC CCT TCC CCT TGT GCG CTG GAA ACG 
Ala Asp Val Pro Ala Glu Asp Asp Pro Ser Pro Cys Ala Leu Glu Thr 
305 310 315 320 

GGC CAC GGG CGG CAG TGC CAG AAC GGC ACG GTG TGC AAG CCC GGC TGG 
Gly His Gly Arg Gin Cys Gin Asn Gly Thr Val Cys Lys Pro Gly Trp 
325 330 335 

GAT GGT CCC AAG CAC GGC ATC ACC AAC TTT GAC AAC TTT GCC TTC GCC 
Asp Gly Pro Lys His Gly Ile Thr Asn Phe Asp Asn Phe Ala Phe Ala 
340 345 350 

ATG CTC ACG GTG TTC CAG TGC ATC ACC ATG GAG GGC TGG ACG GAC GTG 
Met Leu Thr Val Phe Gin Cys Ile Thr Met Glu Gly Trp Thr Asp Val 
355 360 365 

CTG TAC TGG GTC AAT GAT GCC GTA GGA AGG GAC TGG CCC TGG ATC TAT 
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Leu Tyr Trp Val Asn Asp Ala Val Gly Arg Asp Trp Pro Trp lie Tyr 
370 375 380 

TTT GTT ACA CTA ATC ATC ATA GGG TCA TTT TTT GTA CTT AAC TTG GTT 1200 
Phe Val Thr Leu lie He He Gly Ser Phe Phe Val Leu Asn Leu Val 
385 390 395 400 

CTC GGT GTG CTT AGC GGA GAG TTT TCC AAA GAG AGG GAG AAG GCC AAG 1248 
Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala Lys 
405 410 415 

GCC CGG GGA GAT TTC CAG AAG CTG CGG GAG AAG CAG CAG CTA GAA GAG 1296 
Ala Arg Gly Asp Phe Gin Lys Leu Arg Glu Lys Gin Gin Leu Glu Glu 
420 425 430 

GAT CTC AAA GGC TAC CTG GAT TGG ATC ACT CAG GCC GAA GAC ATC GAT 1344 
Asp Leu Lys Gly Tyr Leu Asp Trp He Thr Gin Ala Glu Asp He Asp 
435 440 445 

CCT GAG AAT GAG GAC GAA GGC ATG GAT GAG GAG AAG CCC CGA AAC AGA 1392 
Pro Glu Asn Glu Asp Glu Gly Met Asp Glu Glu Lys Pro Arg Asn Arg 
450 455 460 

GGC ACT CCG GCG GGC ATG CTT GAT CAG AAG AAA GGG AAG TTT GCT TGG 1440 
Gly Thr Pro Ala Gly Met Leu Asp Gin Lys Lys Gly Lys Phe Ala Trp 
465 470 475 480 

TTT AGT CAC TCC ACA GAA ACC CAT GTG AGC ATG CCC ACC AGT GAG ACC 1488 
Phe Ser His Ser Thr Glu Thr His Val Ser Met Pro Thr Ser Glu Thr 
485 490 495 

GAG TCC GTC AAC ACC GAA AAC GTG GCT GGA GGT GAC ATC GAG GGA GAA 1536 
Glu Ser Val Asn Thr Glu Asn Val Ala Gly Gly Asp He Glu Gly Glu 
500 505 510 

AAC TGC GGG GCC AGG CTG GCC CAC CGG ATC TCC AAG TCA AAG TTC AGC 1584 
Asn Cys Gly Ala Arg Leu Ala His Arg He Ser Lys Ser Lys Phe Ser 
515 520 525 

CGC TAC TGG CGC CGG TGG AAT CGG TTC TGC AGA AGG AAG TGC CGC GCC 1632 
Arg Tyr Trp Arg Arg Trp Asn Arg Phe Cys Arg Arg Lys Cys Arg Ala 
530 535 540 

GCA GTC AAG TCT AAT GTC TTC TAC TGG CTG GTG ATT TTC CTG GTG TTC 1680 
Ala Val Lys Ser Asn Val Phe Tyr Trp Leu Val He Phe Leu Val Phe 
545 550 555 560 

CTC AAC ACG CTC ACC ATT GCC TCT GAG CAC TAC AAC CAG CCC AAC TGG 1728 
Leu Asn Thr Leu Thr He Ala Ser Glu His Tyr Asn Gin Pro Asn Trp 
565 570 575 

CTC ACA GAA GTC CAA GAC ACG GCA AAC AAG GCC CTG CTG GCC CTG TTC 1776 
Leu Thr Glu Val Gin Asp Thr Ala Asn Lys Ala Leu Leu Ala Leu Phe 
580 585 590 

ACG GCA GAG ATG CTC CTG AAG ATG TAC AGC CTG GGC CTG CAG GCC TAC 1824 
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Thr Ala Glu Met Leu Leu Lys Met Tyr Ser Leu Gly Leu Gin Ala Tyr 
595 600 605 

TTC GTG TCC CTC TTC AAC CGC TTT GAC TGC TTC GTC GTG TGT GGC GGC 
Phe Val Ser Leu Phe Asn Arg Phe Asp Cys Phe Val Val Cys Gly Gly 
610 615 * 620 

ATC CTG GAG ACC ATC CTG GTG GAG ACC AAG ATC ATG TCC CCA CTG GGC 
lie Leu Glu Thr lie Leu Val Glu Thr Lys He Met Ser Pro Leu Gly 
62 5 630 635 640 

ATC TCC GTG CTC AGA TGC GTC CGG CTG CTG AGG ATT TTC AAG ATC ACG 
He Ser Val Leu Arg Cys Val Arg Leu Leu Arg He Phe Lys He Thr 
645 650 655 

AGG TAC TGG AAC TCC TTG AGC AAC CTG GTG GCA TCC TTG CTG AAC TCT 
Arg Tyr Trp Asn Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser 
660 665 670 

GTG CGC TCC ATC GCC TCC CTG CTC CTT CTC CTC TTC CTC TTC ATC ATC 
Val Arg Ser He Ala Ser Leu Leu Leu Leu Leu Phe Leu Phe He He 
675 680 685 

ATC TTC TCC CTC CTG GGG ATG CAG CTC TTT GGA GGA AAG TTC AAC TTT 
He Phe Ser Leu Leu Gly Met Gin Leu Phe Gly Gly Lys Phe Asn Phe 
690 695 700 

GAT GAG ATG CAG ACC CGG AGG AGC ACA TTC GAT AAC TTC CCC CAG TCC 
Asp Glu Met Gin Thr Arg Arg Ser Thr Phe Asp Asn Phe Pro Gin Ser 
705 710 715 720 

CTC CTC ACT GTG TTT CAG ATC CTG ACC GGG GAG GAC TGG AAT TCG GTG 
Leu Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Ser Val 
725 730 735 

ATG TAT GAT GGG ATC ATG GCT TAT GGG GGC CCC TCT TTT CCA GGG ATG 
Met Tyr Asp Gly He Met Ala Tyr Gly Gly Pro Ser Phe Pro Gly Met 
740 745 750 

TTA GTC TGT ATT TAC TTC ATC ATC CTC TTC ATC TGT GGA AAC TAT ATC 
Leu Val Cys He Tyr Phe He He Leu Phe He Cys Gly Asn Tyr He 
755 760 765 

CTA CTG AAT GTG TTC TTG GCC ATT GCT GTG GAC AAC CTG GCT GAT GCT 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asp Ala 
770 775 780 

GAG AGC CTC ACA TCT GCC CAA AAG GAG GAG GAA GAG GAG AAG GAG AGA 
Glu Ser Leu Thr Ser Ala Gin Lys Glu Glu Glu Glu Glu Lys Glu Arg 
785 790 795 800 

AAG AAG CTG GCC AGG ACT GCC AGC CCA GAG AAG AAA CAA GAG TTG GTG 
Lys Lys Leu Ala Arg Thr Ala Ser Pro Glu Lys Lys Gin Glu Leu Val 
805 810 815 

GAG AAG CCG GCA GTG GGG GAA TCC AAG GAG GAG AAG ATT GAG CTG AAA 
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Glu Lys Pro Ala Val Gly Glu Ser Lys Glu Glu Lys lie Glu Leu Lys 
820 825 830 

TCC ATC ACG GCT GAC GGA GAG TCT CCA CCC GCC ACC AAG ATC AAC ATG 2544 
Ser lie Thr Ala Asp Gly Glu Ser Pro Pro Ala Thr Lys lie Asn Met 
835 840 845 

GAT GAC CTC CAG CCC AAT GAA AAT GAG GAT AAG AGC CCC TAC CCC AAC 2592 
Asp Asp Leu Gin Pro Asn Glu Asn Glu Asp Lys Ser Pro Tyr Pro Asn 
850 855 860 

CCA GAA ACT ACA GGA GAA GAG GAT GAG GAG GAG CCA GAG ATG CCT GTC 2640 
Pro Glu Thr Thr Gly Glu Glu Asp Glu Glu Glu Pro Glu Met Pro Val 
865 870 875 880 

GGC CCT CGC CCA CGA CCA CTC TCT GAG CTT CAC CTT AAG GAA AAG GCA 2688 
Gly Pro Arg Pro Arg Pro Leu Ser Glu Leu His Leu Lys Glu Lys Ala 
885 890 895 

GTG CCC ATG CCA GAA GCC AGC GCG TTT TTC ATC TTC AGC TCT AAC AAC 2736 
Val Pro Met Pro Glu Ala Ser Ala Phe Phe lie Phe Ser Ser Asn Asn 
900 905 910 

AGG TTT CGC CTC CAG TGC CAC CGC ATT GTC AAT GAC ACG ATC TTC ACC 2784 
Arg Phe Arg Leu Gin Cys His Arg lie Val Asn Asp Thr He Phe Thr 
915 920 925 

AAC CTG ATC CTC TTC TTC ATT CTG CTC AGC AGC ATT TCC CTG GCT GCT 2832 
Asn Leu He Leu Phe Phe He Leu Leu Ser Ser He Ser Leu Ala Ala 
930 935 940 

GAG GAC CCG GTC CAG CAC ACC TCC TTC AGG AAC CAT ATT CTG TTT TAT 2880 
Glu Asp Pro Val Gin His Thr Ser Phe Arg Asn His He Leu Phe Tyr 
945 950 955 960 

TTT GAT ATT GTT TTT ACC ACC ATT TTC ACC ATT GAA ATT GCT CTG AAG 2928 
Phe Asp He Val Phe Thr Thr He Phe Thr He Glu He Ala Leu Lys 
965 970 975 

ATG ACT GCT TAT GGG GCT TTC TTG CAC AAG GGT TCT TTC TGC CGG AAC 2976 
Met Thr Ala Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys Arg Asn 
980 * 985 " 990 

TAC TTC AAC ATC CTG GAC CTG CTG GTG GTC AGC GTG TCC CTC ATC TCC 3024 
Tyr Phe Asn He Leu Asp Leu Leu Val Val Ser Val Ser Leu He Ser 
995 1000 1005 

TTT GGC ATC CAG TCC AGT GCA ATC AAT GTC GTG AAG ATC TTG CGA GTC 3072 
Phe Gly He Gin Ser Ser Ala He Asn Val Val Lys He Leu Arg Val 
1010 1015 1020 

CTG CGA GTA CTC AGG CCC CTG AGG GCC ATC AAC AGG GCC AAG GGG CTA 3120 
Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg Ala Lys Gly Leu 
1025 1030 1035 1040 

AAG CAT GTG GTT CAG TGT GTG TTT GTC GCC ATC CGG ACC ATC GGG AAC 3168 
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Lys His Val Val Gin Cys Val Phe Val Ala He Arg Thr He Gly Asn 
1045 1050 " 1055 

ATC GTG ATT GTC ACC ACC CTG CTG CAG TTC ATG TTT GCC TGC ATC GGG 
He Val He Val Thr Thr Leu Leu Gin Phe Met Phe Ala Cys He Gly 
1060 1065 1070 

GTC CAG CTC TTC AAG GGA AAG CTG TAC ACC TGT TCA GAC AGT TCC AAG 
Val Gin Leu Phe Lys Gly Lys Leu Tyr Thr Cys Ser Asp Ser Ser Lys 
1075 1080 1085 

CAG ACA GAG GCG GAA TGC AAG GGC AAC TAC ATC ACG TAC AAA GAC GGG 
Gin Thr Glu Ala Glu Cys Lys Gly Asn Tyr He Thr Tyr Lys Asp Gly 
1090 1095 lioo 

GAG GTT GAC CAC CCC ATC ATC CAA CCC CGC AGC TGG GAG AAC AGC AAG 
Glu Val Asp His Pro He He Gin Pro Arg Ser Trp Glu Asn Ser Lys 
1105 1H0 H15 1120 

TTT GAC TTT GAC AAT GTT CTG GCA GCC ATG ATG GCC CTC TTC ACC GTC 
Phe Asp Phe Asp Asn Val Leu Ala Ala Met Met Ala Leu Phe Thr Val 
1125 H30 H35 

TCC ACC TTC GAA GGG TGG CCA GAG CTG CTG TAC CGC TCC ATC GAC TCC 
Ser Thr Phe Glu Gly Trp Pro Glu Leu Leu Tyr Arg Ser He Asp Ser 
1140 H45 1150 

CAC ACG GAA GAC AAG GGC CCC ATC TAC AAC TAC CGT GTG GAG ATC TCC 
Hxs Thr Glu Asp Lys Gly Pro He Tyr Asn Tyr Arg Val Glu He Ser 
1155 H60 H65 

ATC TTC TTC ATC ATC TAC ATC ATC ATC ATC GCC TTC TTC ATG ATG AAC 
He Phe Phe He lie Tyr He He He He Ala Phe Phe Met Met Asn 
1170 H75 HBO 

ATC TTC GTG GGC TTC GTC ATC GTC ACC TTT CAG GAG CAG GGG GAG CAG 
He Phe Val Gly Phe Val He Val Thr Phe Gin Glu Gin Gly Glu Gin 
1185 1190 H95 1200 

GAG TAC AAG AAC TGT GAG CTG GAC AAG AAC CAG CGA CAG TGC GTG GAA 
Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg Gin Cys Val Glu 
1205 1210 1215 

TAC GCC CTC AAG GCC CGG CCC CTG CGG AGG TAC ATC CCC AAG AAC CAG 
Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He Pro Lys Asn Gin 
1220 1225 1230 

CAC CAG TAC AAA GTG TGG TAC GTG GTC AAC TCC ACC TAC TTC GAG TAC 
His Gin Tyr Lys Val Trp Tyr Val Val Asn Ser Thr Tyr Phe Glu Tyr 
1235 1240 1245 

CTG ATG TTC GTC CTC ATC CTG CTC AAC ACC ATC TGC CTG GCC ATG CAG 
Leu Met Phe Val Leu He Leu Leu Asn Thr He Cys Leu Ala Met Gin 
1250 1255 1260 

CAC TAC GGC CAG AGC TGC CTG TTC AAA ATC GCC ATG AAC ATC CTC AAC 
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His Tyr Gly Gin Ser Cys Leu Phe Lys lie Ala Met Asn He Leu Asn 
1265 1270 1275 1280 

ATG CTC TTC ACT GGC CTC TTC ACC GTG GAG ATG ATC CTG AAG CTC ATT 3888 
Met Leu Phe Thr Gly Leu Phe Thr Val Glu Met He Leu Lys Leu He 
1285 1290 1295 

GCC TTC AAA CCC AAG GGT TAC TTT AGT GAT CCC TGG AAT GTT TTT GAC 3 936 

Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Pro Trp Asn Val Phe Asp 
1300 1305 1310 

TTC CTC ATC GTA ATT GGC AGC ATA ATT GAC GTC ATT CTC AGT GAG ACT 3984 
Phe Leu He Val He Gly Ser He He Asp Val He Leu Ser Glu Thr 
1315 1320 1325 

AAT CCA GCT GAA CAT ACC CAA TGC TCT CCC TCT ATG AAC GCA GAG GAA 4 032 

Asn Pro Ala Glu His Thr Gin Cys Ser Pro Ser Met Asn Ala Glu Glu 
1330 1335 1340 

AAC TCC CGC ATC TCC ATC ACC TTC TTC CGC CTG TTC CGG GTC ATG CGT 4 080 

Asn Ser Arg He Ser He Thr Phe Phe Arg Leu Phe Arg Val Met Arg 
1345 1350 1355 1360 

CTG GTG AAG CTG CTG AGC CGT GGG GAG GGC ATC CGG ACG CTG CTG TGG 4128 
Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He Arg Thr Leu Leu Trp 
1365 1370 1375 

ACC TTC ATC AAG TCC TTC CAG GCC CTG CCC TAT GTG GCC CTC CTG ATC 4176 
Thr Phe He Lys Ser Phe Gin Ala Leu Pro Tyr Val Ala Leu Leu He 
1380 1385 1390 

GTG ATG CTG TTC TTC ATC TAC GCG GTG ATC GGG ATG CAG GTG TTT GGG 4224 
Val Met Leu Phe Phe He Tyr Ala Val He Gly Met Gin Val Phe Gly 
1395 1400 1405 

AAA ATT GCC CTG AAT GAT ACC ACA GAG ATC AAC CGG AAC AAC AAC TTT 4272 
Lys He Ala Leu Asn Asp Thr Thr Glu He Asn Arg Asn Asn Asn Phe 
1410 1415 1420 

CAG ACC TTC CCC CAG GCC GTG CTG CTC CTC TTC AGG TGT GCC ACC GGG 4320 
Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly 
1425 1430 1435 1440 

GAG GCC TGG CAG GAC ATC ATG CTG GCC TGC ATG CCA GGC AAG AAG TGT 4368 
Glu Ala Trp Gin Asp He Met Leu Ala Cys Met Pro Gly Lys Lys Cys 
1445 1450 1455 

GCC CCA GAG TCC GAG CCC AGC AAC AGC ACG GAG GGT GAA ACA CCC TGT 4416 
Ala Pro Glu Ser Glu Pro Ser Asn Ser Thr Glu Gly Glu Thr Pro Cys 
1460 1465 1470 

GGT AGC AGC TTT GCT GTC TTC TAC TTC ATC AGC TTC TAC ATG CTC TGT 4464 
Gly Ser Ser Phe Ala Val Phe Tyr Phe He Ser Phe Tyr Met Leu Cys 
1475 1480 1485 

GCC TTC CTG ATC ATC AAC CTC TTT GTA GCT GTC ATC ATG GAC AAC TTT 4512 
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f^ 0 LeU 116 I1S ASn Phe Val Ma Val Ile Me t Asp Asn Phe 

A * : ' u 1495 1500 

GAC TAC CTG ACA AGG GAC TGG TCC ATC CTT GGT CCC CAC CAC CTG GAT 
Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro His His Leu Asp 
1505 1510 X515 i 5 20 

GAG TTT AAA AGA ATC TGG GCA GAG TAT GAC CCT GAA GCC AAG GGT CGT 
Glu Phe Lys Arg Ile Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg 
1525 1530 15 35 

ATC AAA CAC CTG GAT GTG GTG ACC CTC CTC CGG CGG ATT CAG CCG CCA 
He Lys His Leu Asp Val Val Thr Leu Leu Arg Arg He Gin Pro Pro 
1540 1545 1550 

Su Gil 57 ^ f° CTG TGC CCT ^ CGC GTG GCT TGC AAA CGC CTG 
Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys Lys Arg Leu 
1535 1560 1565 

vll IS m TG o CT CTG ^ AGC GAC GGG ACA GTC ATG TTC AAT 

n Met Pr ° Leu *** Ser G1 y Th * Val Met Phe Asn 

i:>70 1575 1580 

Sff S CC CTG IP GCC CTG GTC AGG ACG GCC CTG AGG ATC AAA ACA GAA 
Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Arg lie Lys Thr Glu 
1585 1590 1595 1600 

nffr T CTA CAA GCC ** T GAG GAG CTG CGG GG <3 ATC ATC AAG AAG 

Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala Ile lie Lys Lys 
1605 1610 1615 

ATC TGG AAG CGG ACC AGC ATG AAG CTG CTG GAC CAG GTG GTG CCC CCT 
lie Trp Lys Arg Thr Ser Met Lys Leu Leu Asp Gin Val Val Pro Pro 
1620 1625 1630 

GCA GGT GAT GAT GAG GTC ACC GTT GGC AAG TTC TAC GCC ACG TTC CTG 
Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu 
1635 1640 1645 

ATC CAG GAG TAC TTC CGG AAG TTC AAG AAG CGC AAA GAG CAG GGC CTT 
G ^« G1U ^ Phe Lys Phe Lys Lys Arg Lys Glu Gin Gly Leu 

1650 1655 1660 

GTG GGC AAG CCC TCC CAG AGG AAC GCG CTG TCT CTG CAG GCT GGC TTG 
Val Gly Lys Pro Ser Gin Arg Asn Ala Leu Ser Leu Gin Ala Gly Leu 
1665 1670 1675 1680 

CGC ACA CTG CAT GAC ATC GGG CCT GAG ATC CGA CGG GCC ATC TCT GGA 
Arg Thr Leu His Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Gly 
1685 1690 1695 

GAT CTC ACC GCT GAG GAG GAG CTG GAC AAG GCC ATG AAG GAG GCT GTG 
Asp Leu Thr Ala Glu Glu Glu Leu Asp Lys Ala Met Lys Glu Ala Val 
1700 1705 1710 

TCC GCT GCT TCT GAA GAT GAC ATC TTC AGG AGG GCC GGT GGC CTG TTC 
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Ser Ala Ala Ser Glu Asp Asp He Phe Arg Arg Ala Gly Gly Leu Phe 
1715 1720 1725 

GGC AAC CAC GTC AGC TAC TAC CAA AGC GAC GGC CGG AGC GCC TTC CCC 5232 
Gly Asn His Val Ser Tyr Tyr Gin Ser Asp Gly Arg Ser Ala Phe Pro 
1730 1735 1740 

CAG ACC TTC ACC ACT CAG CGC CCG CTG CAC ATC AAC AAG GCG GGC AGC 528 0 

Gin Thr Phe Thr Thr Gin Arg Pro Leu His He Asn Lys Ala Gly Ser 
1745 1750 1755 1760 

AGC CAG GGC GAC ACT GAG TCG CCA TCC CAC GAG AAG CTG GTG GAC TCC 5328 
Ser Gin Gly Asp Thr Glu Ser Pro Ser His Glu Lys Leu Val Asp Ser 
1765 1770 1775 

ACC TTC ACC CCG AGC AGC TAC TCG TCC ACC GGC TCC AAC GCC AAC ATC 5376 
Thr Phe Thr Pro Ser Ser Tyr Ser Ser Thr Gly Ser Asn Ala Asn He 
1780 1785 1790 

AAC AAC GCC AAC AAC ACC GCC CTG GGT CGC CTC CCT CGC CCC GCC GGC 5424 
Asn Asn Ala Asn Asn Thr Ala Leu Gly Arg Leu Pro Arg Pro Ala Gly 
1795 1800 1805 

TAC CCC AGC ACA GTC AGC ACT GTG GAG GGC CAC GGG CCC CCC TTG TCC 5472 
Tyr Pro Ser Thr Val Ser Thr Val Glu Gly His Gly Pro Pro Leu Ser 
1810 1815 1820 

CCT GCC ATC CGG GTG CAG GAG GTG GCG TGG AAG CTC AGC TCC AAC AGG 5520 
Pro Ala He Arg Val Gin Glu Val Ala Trp Lys Leu Ser Ser Asn Arg 
1825 1830 1835 1840 

TGC CAC TCC CGG GAG AGC CAG GCA GCC ATG GCG CGT CAG GAG GAG ACG 5568 
Cys His Ser Arg Glu Ser Gin Ala Ala Met Ala Arg Gin Glu Glu Thr 
1845 1850 1855 

TCT CAG GAT GAG ACC TAT GAA GTG AAG ATG AAC CAT GAC ACG GAG GCC 5616 
Ser Gin Asp Glu Thr Tyr Glu Val Lys Met Asn His Asp Thr Glu Ala 
1860 1865 1870 

TGC AGT GAG CCC AGC CTG CTC TCC ACA GAG ATG CTC TCC TAC CAG GAT 5664 
Cys Ser Glu Pro Ser Leu Leu Ser Thr Glu Met Leu Ser Tyr Gin Asp 
1875 1880 1885 

GAC GAA AAT CGG CAA CTG ACG CTC CCA GAG GAG GAC AAG AGG GAC ATC 5712 
Asp Glu Asn Arg Gin Leu Thr Leu Pro Glu Glu Asp Lys Arg Asp He 
1890 1895 1900 

CGG CAA TCT CCG AAG AGG GGT TTC CTC CGC TCT GCC TCA CTA GGT CGA 5760 
Arg Gin Ser Pro Lys Arg Gly Phe Leu Arg Ser Ala Ser Leu Gly Arg 
1905 1910 1915 1920 

AGG GCC TCC TTC CAC CTG GAA TGT CTG AAG CGA CAG AAG GAC CGA GGG 5808 
Arg Ala Ser Phe His Leu Glu Cys Leu Lys Arg Gin Lys Asp Arg Gly 
1925 1930 1935 

GGA GAC ATC TCT CAG AAG ACA GTC CTG CCC TTG CAT CTG GTT CAT CAT 5856 



WO 95/04822 



PCT/US94/09230 



-136- 



Gly Asp lie Ser Gin Lys Thr Val Leu Pro Leu His Leu Val His His 
1940 1945 1950 

CAG GCA TTG GCA GTG GCA GGC CTG AGC CCC CTC CTC CAG AGA AGC CAT 
Gin Ala Leu Ala Val Ala Gly Leu Ser Pro Leu Leu Gin Arg Ser His 
1955 i960 1965 

TCC CCT GCC TCA TTC CCT AGG CCT TTT GCC ACC CCA CCA GCC ACA CCT 
Ser Pro Ala Ser Phe Pro Arg Pro Phe Ala Thr Pro Pro Ala Thr Pro 
1970 1975 1980 

GGC AGC CGA GGC TGG CCC CCA CAG CCC GTC CCC ACC CTG CGG CTT GAG 
Gly Ser Arg Gly Trp Pro Pro Gin Pro Val Pro Thr Leu Arg Leu Glu 
1985 1990 1995 2000 

GGG GTC GAG TCC AGT GAG AAA CTC AAC AGC AGC TTC CCA TCC ATC CAC 
Gly Val Glu Ser Ser Glu Lys Leu Asn Ser Ser Phe Pro Ser lie His 
2005 2010 2015 

TGC GGC TCC TGG GCT GAG ACC ACC CCC GGT GGC GGG GGC AGC AGC GCC 
Cys Gly Ser Trp Ala Glu Thr Thr Pro Gly Gly Gly Gly ser Ser Ala 
2020 2025 2030 

GCC CGG AGA GTC CGG CCC GTC TCC CTC ATG GTG CCC AGC CAG GCT GGG 
Ala Arg Arg Val Arg Pro Val Ser Leu Met Val Pro Ser Gin Ala Glv 
2035 2040 2045 

GCC CCA GGG AGG CAG TTC CAC GGC AGT GCC AGC AGC CTG GTG GAA GCG 
Ala Pro Gly Arg Gin Phe His Gly Ser Ala Ser Ser Leu Val Glu Ala 
2050 2055 2060 

GTC TTG ATT TCA GAA GGA CTG GGG CAG TTT GCT CAA GAT CCC AAG TTC 
Val Leu He Ser Glu Gly Leu Gly Gin Phe Ala Gin Asp Pro Lys Phe 
2070 2075 2080 2085 

ATC GAG GTC ACC ACC CAG GAG CTG GCC GAC GCC TGC GAC ATG ACC ATA 
He Glu Val Thr Thr Gin Glu Leu Ala Asp Ala Cys Asp Met Thr He 
2090 2095 2100 

GAG GAG ATG GAG AGC GCG GCC GAC AAC ATC CTC AGC GGG GGC GCC CCA 
Glu Glu Met Glu Ser Ala Ala Asp Asn He Leu Ser Gly Gly Ala Pro 
2105 2110 2115 

CAG AGC CCC AAT GGC GCC CTC TTA CCC TTT GTG AAC TGC AGG GAC GCG 
Gin Ser Pro Asn Gly Ala Leu Leu Pro Phe Val Asn Cys Arg Asp Ala 
2120 2125 2130 

GGG CAG GAC CGA GCC GGG GGC GAA GAG GAC GCG GGC TGT GTG CGC GCG 
Gly Gin Asp Arg Ala Gly Gly Glu Glu Asp Ala Gly Cys Val Arg Ala 
2135 2135 2140 

CGG GGT CGA CCG AGT GAG GAG GAG CTC CAG GAC AGC AGG GTC TAC GTC 
Arg Gly Arg Pro Ser Glu Glu Glu Leu Gin Asp Ser Arg Val Tyr Val 
2145 2150 2155 2160 



AGC AGC CTG TAGTGGGCGC TGCCAGATGC GGGCTTTTTT TTATTTGTTT CAATGTTCCT 653 9 
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AATGGGTTCG TTTCAGAAGT GCCTCACTGT TCTCGT 6575 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
AGACCACGGC TTCCTCGAAT CTTGCGCGAA GCCGCCGGCCA TCGGAGGAG GGATTAATCC 60 
AGACCCGCCG GGGGGTGTTT TCACATTTCT TCCTCTTCGTG GCTGCTCCT CCTATTAAAA 120 
CCATTTTTGG TCC 133 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCTGAGGGC CTTCCGCGTG CTGCGCCCCC TGCGGCTGGT GTCCGGAGTC CCAAGTCTCC 60 
AGGTGGTCCT GAATTCCATC ATCAAGGCC 89 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 84 

(D) OTHER INFORMATION: /note= "An alternative exon of 
alpha- 1C." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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CAC TAT TTC TGT GAT GCA TGG AAT ACA TTT GAC GCC TTG ATT GTT GTG 4 8 

His Tyr Phe Cys Asp Ala Trp Asn Thr Phe Asp Ala Leu lie Val Val 
1 5 10 15 

GGT AGC ATT GTT GAT ATA GCA ATC ACC GAG GTA AAC 84 
Gly Ser lie Val Asp lie Ala He Thr Glu Val Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 144 .. 7163 

(ix) FEATURE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION: 1..143 

(ix) FEATURE: 

(A) NAME/KEY: 3 'UTR 

(B) LOCATION: 7161.. 7362 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCGGCGGCGG CTGCGGCGGT GGGGCCGGGC GAGGTCCGTG CGGTCCCGGC GGCTCCGTGG 60 

CTGCTCCGCT CTGAGCGCCT GCGCGCCCCG CGCCCTCCCT GCCGGGGCCG CTGGGCCGGG 120 

GATGCACGCG GGGCCCGGGA GCC ATG GTC CGC TTC GGG GAC GAG CTG GGC 170 
Met Val Arg Phe Gly Asp Glu Leu Gly 
15 

GGC CGC TAT GGA GGC CCC GGC GGC GGA GAG CGG GCC CGG GGC GGC GGG 218 
Gly Arg Tyr Gly Gly Pro Gly Gly Gly Glu Arg Ala Arg Gly Gly Gly 
10 15 20 25 

GCC GGC GGG GCG GGG GGC CCG GGT CCC GGG GGG CTG CAG CCC GGC CAG 266 
Ala Gly Gly Ala Gly Gly Pro Gly Pro Gly Gly Leu Gin Pro Gly Gin 
30 35 40 

CGG GTC CTC TAC AAG CAA TCG ATC GCG CAG CGC GCG CGG ACC ATG GCG 314 
Arg Val Leu Tyr Lys Gin Ser He Ala Gin Arg Ala Arg Thr Met Ala 
45 50 " 55 

CTG TAC AAC CCC ATC CCG GTC AAG CAG AAC TGC TTC ACC GTC AAC CGC 362 
Leu Tyr Asn Pro He Pro Val Lys Gin Asn Cys Phe Thr Val Asn Arg " ~ 
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TCG CTC TTC GTC TTC AGC GAG GAC AAC GTC GTC CGC AAA TAC GCG AAG 
Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys 
75 80 85 

CGC ATC ACC GAG TGG CCT CCA TTC GAG AAT ATG ATC CTG GCC ACC ATC 
Arg lie Thr Glu Trp Pro Pro Phe Glu Asn Met He Leu Ala Thr He 
90 95 100 105 

ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG CAC CTC CCT GAT GGG 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Asp Gly 
110 115 120 

GAC AAA ACG CCC ATG TCC GAG CGG CTG GAC GAC ACG GAG CCC TAT TTC 
Asp Lys Thr Pro Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe 
125 130 135 

ATC GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT CTG GGC 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly 
140 145 150 

TTT GTC TTC CAC AAG GGC TCT TAC CTG CGG AAC GGC TGG AAC GTC ATG 
Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
155 160 165 

GAC TTC GTG GTC GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA ACT GAC 
Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly Thr Asp 
170 175 180 185 

TTC GAC CTG CGA ACA CTG AGG GCT GTG CGT GTG CTG AGG CCC CTG AAG 
Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys 
190 195 200 

CTG GTG TCT GGG ATT CCA AGT TTG CAG GTG GTG CTC AAG TCC ATC ATG 
Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser He Met 
205 210 215 

AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC TTT GCC 
Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala 
220 225 230 

ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAC ATG GGC AAG TTC 
He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe 
235 240 245 

CAC AAG GCC TGT TTC CCC AAC AGC ACA GAT GCG GAG CCC GTG GGT GAC 
His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val Gly Asp 
250 * 255 260 265 

TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CTG TGC GAG GGC GAC ACT 
Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys Glu Gly Asp Thr 
270 275 280 

GAG TGC CGG GAG TAC TGG CCA GGA CCC AAC TTT GGC ATC ACC AAC TTT 
Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly He Thr Asn Phe 
285 290 295 
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GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC ACC ATG 
Asp Asn lie Leu Phe Ala lie Leu Thr Val Phe Gin Cys He Thr Met 
300 305 310 

GAG GGC TGG ACT GAC ATC CTC TAT AAT ACA AAC GAT GCG GCC GGC AAC 
Glu Gly Trp Thr Asp He Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn 
315 320 325 

ACC TGG AAC TGG CTC TAC TTC ATC CCT CTC ATC ATC ATC GGC TCC TTC 
Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe 
330 335 340 345 

TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT GCC AAG 
Phe Met Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys 
350 355 360 

GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG CGC CGG 
Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg 
365 370 375 

CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG TAC CTG GAG TGG ATC TTC 
Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Leu Glu Trp He Phe 
380 385 * 390 

AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAC AGG AAT GCA GAG GAG 
Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asp Arg Asn Ala Glu Glu 
395 400 405 

AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA 
Lys Ser Pro Leu Asp Val Leu Lys Arg Ala Ala Thr Lys Lys Ser Arg 
410 415 420 425 

AAT GAC CTG ATC CAC GCA GAG GAG GGA GAG GAC CGG TTT GCA GAT CTC 
Asn Asp Leu He His Ala Glu Glu Gly Glu Asp Arg Phe Ala Asp Leu 
430 435 440 

TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC GGG AAG 
Cys Ala Val Gly Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser Gly Lys 
445 450 455 

ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC CGG TTT 
Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe Arg Phe 
460 465 470 

TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG GTG CTG 
Phe lie Arg Arg Met Val Lys Ala Gin Ser Phe Tyr Trp Val Val Leu 
475 480 485 

TGC GTG GTG GCC CTG AAC ACA CTG TGT GTG GCC ATG GTG CAT TAC AAC 
Cys Val Val Ala Leu Asn Thr Leu Cys Val Ala Met Val His Tyr Asn 
490 495 500 505 

CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT GTT TTC 
Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe Val Phe 
510 515 520 
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CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AAG ATG TAT GGC CTG GGG 
Leu Gly Leu Phe Leu Thr Glu Met Ser Leu Lys Met Tyr Gly Leu Gly 
525 530 535 

CCC AGA AGC TAC TTC CGG TCC TCC TTC AAC TGC TTC GAC TTT GGG GTC 
Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
540 545 550 

ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG CCG GGA 
lie Val Gly Ser Val Phe Glu Val Val Trp Ala Ala He Lys Pro Gly 
555 560 565 

AGC TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG AGG ATC 
Ser Ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He 
570 575 580 585 

TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG CGG AAC CTG GTG GTG TCC 
Phe Lys Val Thr Lys Tyr Trp Ser Ser Leu Arg Asn Leu Val Val Ser 
590 595 600 

CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG CTC TTC 
Leu Leu Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe 
605 610 615 

CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG CAG CTG TTT GGG GGA 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
620 625 630 

CAG TTC AAC TTC CAG GAT GAG ACT CCC ACA ACC AAC TTC GAC ACC TTC 
Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Thr Asn Phe Asp Thr Phe 
635 640 645 

CCT GCC GCC ATC CTC ACT GTC TTC CAG ATC CTG ACG GGA GAG GAC TGG 
Pro Ala Ala He Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
650 655 660 665 

AAT GCA GTG ATG TAT CAC GGG ATC GAA TCG CAA GGC GGC GTC AGC AAA 
Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val Ser Lys 
670 675 680 

GGC ATG TTC TCG TCC TTT TAC TTC ATT GTC CTG ACA CTG TTC GGA AAC 
Gly Met Phe Ser Ser Phe Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
685 690 695 

TAC ACT CTG CTG AAT GTC TTT CTG GCC ATC GCT GTG GAC AAC CTG GCC 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
700 705 710 

AAC GCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA GCA GCC 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu Met Glu Glu Ala Ala 
715 720 725 

AAT CAG AAG CTT GCT CTG CAA AAG GCC AAA GAA GTG GCT GAA GTC AGC 
Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val Ser 
730 " 735 740 745 
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CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAG AAC TCG 
Pro Met Ser Ala Ala Asn He Ser He Ala Ala Arg Gin Gin Asn Ser 
750 755 ~ 760 

GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA CGG CTG 
Ala Lys Ala Arg Ser Val Trp Glu Gin Arg Ala Ser Gin Leu Arg Leu 
765 770 775 

CAG AAC CTG CGG GCC AGC TGC GAG GCG CTG TAC AGC GAG ATG GAC CCC 
Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met Asp Pro 
780 785 790 

GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CTG CGG CCC GAC ATG 
Glu Glu Arg Leu Arg Phe Ala Thr Thr Arg His Leu Arg Pro Asp Met 
? 95 800 805 

AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC GAC GGC 
Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu Gly Arg Asp Glv 
810 815 820 825 

GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG GCC 
Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu Ala 
830 835 840 

CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG 
Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys 
845 850 855 

GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG 
Asp Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys 
860 865 870 

GCG GAG AGC GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG CCG CAC 
Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His 
875 880 885 

CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG 
Arg Ser His Ser Lys Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu 
890 895 900 905 

CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC CGG CGC 
Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His Arg Arg 
910 915 920 

GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC CGC GCG 
Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His Arg Ala 
925 930 93 5 

CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC GGC GCC AAG GGC GAG 
His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys Gly Glu 
940 945 950 

CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC CGG GAG GCG 
Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg Glu Ala 
955 960 965 
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GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC CGG GCC CGG CAC AAG GCG 3098 
Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His Lys Ala 
970 975 980 985 

CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG GAG GCC 3 146 

Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys Glu Ala 
990 995 1000 

ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG GAG CTC 3194 
Thr Glu Lys Glu Ala Glu lie Val Glu Ala Asp Lys Glu Lys Glu Leu 
1005 1010 ' 1015 

CGG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC AGT GGG 3242 
Arg Asn His Gin Pro Arg Glu Pro His Cys Asp Leu Glu Thr Ser Gly 
1020 1025 1030 

ACT GTG ACT GTG GGT CCC ATG CAC ACA CTG CCC AGC ACC TGT CTC CAG 3290 
Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys Leu Gin 
1035 1040 1045 

AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC GTC ACT 3338 
Lys Val Glu Glu Gin Pro Glu Asp Ala Asp Asn Gin Arg Asn Val Thr 
1050 1055 1060 1065 

CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT ATC CCA 3386 
Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr He Val His He Pro 
1070 1075 1080 

GTG ATG CTG ACG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC AGT GGT 3434 
Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro Ser Gly 
1085 1090 1095 

AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG GAA GCG 34 82 

Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val Glu Ala 
1100 1105 1110 

GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC AGC TCC 353 0 

Asp Asp Val Met Arg Ser Gly Pro Arg Pro He Val Pro Tyr Ser Ser 
1115 1120 1125 

ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC TGC CAC TAC 3 57 8 

Met Phe Cys Leu Ser Pro Thr Asn Leu Leu Arg Arg Phe Cys His Tyr 
1130 1135 1140 * 1145 

ATC GTG ACC ATG AGG TAC TTC GAG GTG GTC ATT CTC GTG GTC ATC GCC 362 6 

He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val He Ala 
1150 1155 1160 

TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA GAC TCG 3674 
Leu Ser Ser He Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp Ser 
1165 1170 1175 

CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT GGT GTC 3 722 

Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr He Phe Thr Gly Val 
1180 1185 1190 
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TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG CTG CTT 
Phe Thr Phe Glu Met Val lie Lys Met lie Asp Leu Gly Leu Leu Leu 
1195 1200 1205 

CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC TTC ATT 
His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn lie Leu Asp Phe lie 
1210 1215 1220 1225 

GTG GTC AGT GGC GCC CTG GTG GCG TTT GCT TTC TCA GGA TCC AAA GGG 
Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser Lys Gly 
1230 1235 1240 

AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC CTT CGT GTC CTG CGG 
Lys Asp He Asn Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg 
1245 1250 1255 

CCC CTC AAG ACC ATC AAA CGG CTG CCC AAG CTC AAG GCT GTG TTT GAC 
Pro Leu Lys Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp 
1260 1265 1270 

TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT GTC TAC 
Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu He Val Tyr 
1275 1280 1285 

ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG CAG CTC TTC AAA 
Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys 
1290 1295 1300 1305 

GGG AAG TTT TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG AGG GAC 
Gly Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu Arg Asp 
1310 1315 1320 

TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CAG 
Cys Arg Gly Gin Tyr Leu Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin 
1325 1330 1335 

CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG CTC TGG 
Pro Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val Leu Trp 
1340 1345 1350 

GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG CCC ATG 
Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Met 
1355 1360 1365 

GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT CCA AGC 
Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly Pro Ser 
1370 1375 1380 1385 

CCT GGG TAC CGC ATG GAG CTG TCC ATC TTC TAC GTG GTC TAC TTT GTG 
Pro Gly Tyr Arg Met Glu Leu Ser He Phe Tyr Val Val Tyr Phe Val 
1390 1395 1400 

GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC ATC ATC 
Val Phe Pro Phe Phe Phe Val Asn He Phe Val Ala Leu He He He 
1405 1410 1415 
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ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC CTG GAG 
Thr Phe Gin Glu Gin Gly Asp Lys Val Met Ser Glu Cys Ser Leu Glu 
1420 1425 1430 

AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC GCC AAA CCC CTG 
Lys Asn Glu Arg Ala Cys He Asp Phe Ala He Ser Ala Lys Pro Leu 
1435 1440 1445 

ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG ACG TGG 
Thr Arg Tyr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys Thr Tip 
1450 1455 1460 1465 

ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC ATG ATA 
Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe He Met Ala Met He 
1470 1475 1480 

GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA CCC TAT 
Ala Leu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala Pro Tyr 
1485 1490 1495 

GAG TAC GAG CTG ATG CTG AAA TGC CTG AAC ATC GTG TTC ACA TCC ATG 
Glu Tyr Glu Leu Met Leu Lys Cys Leu Asn He Val Phe Thr Ser Met 
1500 1505 1510 

TTC TCC ATG GAA TGC GTG CTG AAG ATC ATC GCC TTT GGG GTG CTG AAC 
Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val Leu Asn 
1515 1520 1525 

TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG TTG GGA 
Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val Leu Gly 
1530 1535 1540 1545 

AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCG GAA ACG AAC AAT TTC 
Ser He Thr Asp He Leu Val Thr Glu He Ala Glu Thr Asn Asn Phe 
1550 1555 1560 

ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 
He Asn Leu Ser Phe Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys 
1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 
Leu Leu Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val 
1580 ' ' 1585 1590 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 
Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu 
1595 1600 1605 

TTC TTC ATC TAC GCC ATC ATC GGC ATG CAG GTG TTT GGG AAT ATT GCC 
Phe Phe He Tyr Ala lie He Gly Met Gin Val Phe Gly Asn He Ala 
1610 1615 1620 1625 



5066 
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S? C t CTG CTG CTG TTC AGG AGC GCC ACG GGG °AG GCC TGG 

Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp 
1645 1650 1655 

Sis Glu «f mI G t CTG I CC ^ GC CTG AGC ^ ^ GCC TGT GAT GAG CAG 
His Glu lie Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 16 70 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC GTC 
Ala Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr III vll 
1675 1680 1685 

sSr Phf ^ Shf t CTG I GC I CC T P CTG ATG TTG ^ CTC TT T GTG OCT 
ffi^n S LeU 0X8 Ser Phe Leu Met Leu Asn Leu Phe Val Ala 

1690 1695 1700 ivos 

vl? tT C « TG GAC m ^ TAC CTC AGG CGG GAC TCT TCC ATC CTA 

Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser lie Leu 
1710 1715 1720 

£J So 2S H A c ESS GAT GAG U C A ?C~CGG GTC TGG GCT GAA TAC GAC 
Qly Pro His his Leu Asp Glu Phe He Arg Val Trp Ala Glu Tyr Asp 
1725 1730 1735 

GGG GC * ? CG TGT GGG CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG 
Pro Ala Ala cys Gly Arg He Ser Tyr Asn- Asp Met Phe Glu Met Leu 
l' 40 1745 1750 

AAA CAC ATG TCC CCG CCT CTG GGG CTG GGG AAG AAA TGC CCT GCT CGA 
Lys His Met Ser Pro Pro Leu Gly Leu Gly Lys Lys Cys Pro Ala Arg 
1755 1760 1765 

%H GCT I AC **° CGC CTG GTT CGC ATG ** C ATG CCC ATC TCC AAC GAG 
Val Ala Tyr Lys Arg Leu Val Arg Met Asn Met Pro lie Ser Asn Glu 
1770 1775 1780 1785 

a— mI G £ CT 5*7 5 AC 1X0 ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 
Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg Thr 
1? 90 1795 1800 

GCA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG CAT CAG TGT 
Ala Leu Glu He Lys Leu Ala Pro Ala Gly Thr Lys Gin His Gin Cys 
1805 1810 1815 

GAC GCG GAG TTG AGG AAG GAG ATT TCC GTT GTG TGG GCC AAT CTG CCC 
Asp Ala Glu Leu Arg Lys Glu He Ser Val Val Trp Ala Asn Leu Pro 
1820 1825 1830 

CAG AAG ACT TTG GAC TTG CTG GTA CCA CCC CAT AAG CCT GAT GAG ATG 
Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu Met 
1835 1840 1845 

ACA GTG GGG AAG GTT TAT GCA GCT CTG ATG ATA TTT GAC TTC TAC AAG 
?nL Val Gly LyS Val T * r Ala Ala Leu Met Ile phe Asp Tyr Lys 

1850 1855 i860 1865 
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CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC CTC 
Gin Asn Lys Thr Thr Arg Asp Gin Met Gin Gin Ala Pro Gly Gly Leu 
1870 1875 1880 

TCC CAG ATG GGT CCT GTG TCC CTG TTC CAC CCT CTG AAG GCC ACC CTG 
Ser Gin Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 
1885 1890 1895 

GAG CAG ACA CAG CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 
Glu Gin Thr Gin Pro Ala Val Leu Arg Gly Ala Arg Val Phe Leu Arg 
1900 1905 1910 

CAG AAG AGT TCC ACC TCC CTC AGC AAT GGC GGG GCC ATA CAA AAC CAA 
Gin Lys Ser Ser Thr Ser Leu Ser Asn Gly Gly Ala lie Gin Asn Gin 
1915 1920 1925 

GAG AGT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAG 
Glu Ser Gly He Lys Glu Ser Val Ser Trp Gly Thr Gin Arg Thr Gin 
1930 1935 1940 1945 



GAT 
Asp 


GCA 
Ala 


CCC 
Pro 


CAT GAG GCC 
His GlU Ala 
1950 


AGG 
Arg 


CCA 

Pro 


CCC 
Pro 


CTG GAG CGT 
Leu Glu Arg 
1955 


GGC CAC 
Gly His 


TCC ACA 
Ser Thr 
1960 


6026 


GAG 
Glu 


ATC 
He 


CCT 
Pro 


GTG GGG CGG 
Val Gly Arg 
1965 


TCA 

Ser 


GGA 
Gly 


GCA CTG GCT 
Ala Leu Ala 
1970 


GTG 
Val 


GAC GTT CAG 
Asp Val Gin 
1975 


ATG 
Met 


6074 


CAG 
Gin 


AGC 
Ser 


ATA ACC CGG AGG 
He Thr Arg Arg 
1980 


GGC 
Gly 


CCT GAT GGG GAG 
Pro Asp Gly Glu 
1985 


CCC 
Pro 


CAG CCT 
Gin Pro 
1990 


GGG 
Gly 


CTG 
Leu 


6122 


GAG 
Glu 


AGC CAG 
Ser Gin 
1995 


GGT CGA GCG 
Gly Arg Ala 


GCC TCC 
Ala Ser 
2000 


ATG 
Met 


CCC CGC 
Pro Arg 


CTT GCG GCC 
Leu Ala Ala 
2005 


GAG 
Glu 


ACT 
Thr 


6170 


CAG CCC 
Gin Pro 
2010 


GTC 
Val 


ACA GAT GCC AGC 
Thr Asp Ala Ser 
2015 


CCC 
Pro 


ATG 
Met 


AAG CGC TCC 
Lys Arg Ser 
2020 


ATC TCC 
He Ser 


ACG 
Thr 


CTG 
Leu 
2025 


6218 


GCC 
Ala 


CAG 
Gin 


CGG 
Arg 


CCC CGT GGG 
Pro Arg Gly 
2030 


ACT 
Thr 


CAT 
His 


CTT 
Leu 


TGC AGC 
Cys Ser 
2035 


ACC 
Thr 


ACC CCG 
Thr Pro 


GAC CGC 
Asp Arg 
2040 


6266 


CCA 
Pro 


CCC 
Pro 


CCT 
Pro 


AGC CAG GCG 
Ser Gin Ala 
2045 


TCG 
Ser 


TCG 
Ser 


CAC CAC CAC 
His His His 
2050 


CAC 
His 


CAC CGC TGC 
His Arg Cys 
2055 


CAC 
His 


6314 


CGC 
Arg 


CGC 
Arg 


AGG GAC AGG AAG 
Arg Asp Arg Lys 
2060 


CAG 
Gin 


AGG TCC 
Arg ser 
2065 


CTG GAG 
Leu Glu 


AAG 
Lys 


GGG CCC 
Gly Pro 
2070 


AGC 
Ser 


CTG 
Leu 


6362 


TCT 

Ser 


GCC GAT 
Ala Asp 
2075 


ATG GAT GGC GCA CCA 
Met Asp Gly Ala Pro 
2080 


AGC 
Ser 


AGT GCT 
Ser Ala 


GTG GGG CCG GGG 
Val Gly Pro Gly 
2085 


CTG 
Leu 


6410 
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s s s a s gs s s s s s s s 5 s 

2100 2105 

SS SS S S EE SS SS S S SS E IS 5 E S 

2115 2120 

s ss ass s e E gj ~ « s - « s K 

2130 2135 

E E SS E E E E E S E 2S S S E S S 

2145 2 150 

s s ss ss ^ s* s as s s s es e s s 

S E E E S SS E S S E K S SS S 

2175 2180 S 2185 

?S S S SS EE SS S E E £ E SS E E S 

2155 2200 

S SS S SS S E S EE SS E S SS SS 
S E EE S E E ESS E E SSE E 
SS KS S E E H5 S E E E SSE E S 
E S S S S EE SS E SS 252 S S S 
S SS S E EE E E E SS EE E S E K 

2270 2275 2280 

QlS A1°J v T ? A E C AAC TCG GGC CGC TCC TC C AGG ACT TCC TAG GTG 

Glu Ala Val Ala Thr Asn Ser Gly Arg Ser Ser Arg Thr Ser T?r Si 
2285 2290 2295 

Ser H£ t CTG ^ C TCC TCT ^ CCT CTC CGC CGC GTG CCC AAC GGT 

Ser ser Leu^Thr Ser Gin Ser His^Pro Leu Arg Arg J.1 P» JSS Sty 
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TAC CAC TGC ACC CTG GGA CTC AGC TCG GGT GGC CGA GCA CGG CAC AGC 7130 
Tyr His Cys Thr Leu Gly Leu Ser Ser Gly Gly Arg Ala Arg His Ser 
2315 2320 2325 

TAC CAC CAC CCT GAC CAA GAC CAC TGG TGC TAGCTGCACC GTGACCGCTC 7180 
Tyr His His Pro Asp Gin Asp His Trp Cys 
2330 2335 234 

AGACGCCTGC ATGCAGCAGG CGTGTGTTCC AGTGGATGAG TTTTATCATC CACACGGGGC 7240 
AGTCGGCCCT CGGGGGAGGC CTTGCCCACC TTGGTGAGGC TCCTGTGGCC CCTCCCTCCC 7300 
CCTCCTCCCC TCTTTTACTC TAGACGACGA ATAAAGCCCT GTTGCTTGAG TGTACGTACC 7360 
GC 7362 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 7175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 144.. 6857 

(ix) FEATURE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION: 1..143 

(ix) FEATURE: 

(A) NAME/KEY: 3 'UTR 

(B) LOCATION: 6855.. 7175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGGCGGCGG CTGCGGCGGT GGGGCCGGGC GAGGTCCGTG CGGTCCCGGC GGCTCCGTGG 60 

CTGCTCCGCT CTGAGCGCCT GCGCGCCCCG CGCCCTCCCT GCCGGGGCCG CTGGGCCGGG 12 0 

GATGCACGCG GGGCCCGGGA GCC ATG GTC CGC TTC GGG GAC GAG CTG GGC 170 
Met Val Arg Phe Gly Asp Glu Leu Gly 
1 5 

GGC CGC TAT GGA GGC CCC GGC GGC GGA GAG CGG GCC CGG GGC GGC GGG 218 
Gly Arg Tyr Gly Gly Pro Gly Gly Gly Glu Arg Ala Arg Gly Gly Gly 
10 15 20 25 



GCC GGC GGG GCG GGG GGC CCG GGT CCC GGG GGG CTG CAG CCC GGC CAG 266 
Ala Gly Gly Ala Gly Gly Pro Gly Pro Gly Gly Leu Gin Pro Gly Gin 
30 35 40 
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CGG GTC CTC TAC AAG CAA TCG ATC GCG CAG CGC GCG CGG ACC ATG GCG 
Arg Val Leu Tyr Lys Gin Ser lie Ala Gin Arg Ala Arg Thr Met Ala 
45 50 ' 55 

CTG TAC AAC CCC ATC CCG GTC AAG CAG AAC TGC TTC ACC GTC AAC CGC 
Leu Tyr Asn Pro He Pro Val Lys Gin Asn Cys Phe Thr Val Asn Arg 
60 65 70 

TCG CTC TTC GTC TTC AGC GAG GAC AAC GTC GTC CGC AAA TAC GCG AAG 
Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys 
75 80 85 

CGC ATC ACC GAG TGG CCT CCA TTC GAG AAT ATG ATC CTG GCC ACC ATC 
Arg He Thr Glu Trp Pro Pro Phe Glu Asn Met He Leu Ala Thr He 
90 95 100 105 

ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG CAC CTC CCT GAT GGG 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Asp Gly 
HO 115 120 

GAC AAA ACG CCC ATG TCC GAG CGG CTG GAC GAC ACG GAG CCC TAT TTC 
Asp Lys Thr Pro Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe 
125 130 135 

ATC GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT CTG GGC 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly 
140 145 i 50 

TTT GTC TTC CAC AAG GGC TCT TAC CTG CGG AAC GGC TGG AAC GTC ATG 
Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
155 160 165 

GAC TTC GTG GTC GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA ACT GAC 
Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly Thr Asp 
170 175 180 185 

TTC GAC CTG CGA ACA CTG AGG GCT GTG CGT GTG CTG AGG CCC CTG AAG 
Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys 
190 195 200 

CTG GTG TCT GGG ATT CCA AGT TTG CAG GTG GTG CTC AAG TCC ATC ATG 
Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser He Met 
205 210 215 

AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC TTT GCC 
Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala 
220 225 230 

ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAC ATG GGC AAG TTC 
He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe 
235 240 245 

CAC AAG GCC TGT TTC CCC AAC AGC ACA GAT GCG GAG CCC GTG GGT GAC 
His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val Gly Asp 
250 255 260 265 
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TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CTG TGC GAG GGC GAC ACT 
Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys Glu Gly Asp Thr 
270 275 280 

GAG TGC CGG GAG TAC TGG CCA GGA CCC AAC TTT GGC ATC ACC AAC TTT 
Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly lie Thr Asn Phe 
285 290 295 

GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC ACC ATG 
Asp Asn lie Leu Phe Ala lie Leu Thr Val Phe Gin Cys lie Thr Met 
300 305 310 

GAG GGC TGG ACT GAC ATC CTC TAT AAT ACA AAC GAT GCG GCC GGC AAC 
Glu Gly Trp Thr Asp lie Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn 
315 320 325 

ACC TGG AAC TGG CTC TAC TTC ATC CCT CTC ATC ATC ATC GGC TCC TTC 
Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe 
330 335 340 345 

TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT GCC AAG 
Phe Met Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys 
350 355 360 

GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG CGC CGG 
Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg 
365 370 375 

CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG TAC CTG GAG TGG ATC TTC 
Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Leu Glu Trp He Phe 
380 385 390 

AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAC AGG AAT GCA GAG GAG 
Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asp Arg Asn Ala Glu Glu 
395 400 405 

AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA 
Lys Ser Pro Leu Asp Val Leu Lys Arg Ala Ala Thr Lys Lys Ser Arg 
410 415 420 425 

AAT GAC CTG ATC CAC GCA GAG GAG GGA GAG GAC CGG TTT GCA GAT CTC 
Asn Asp Leu He His Ala Glu Glu Gly Glu Asp Arg Phe Ala Asp Leu 
430 435 440 

TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC GGG AAG 
Cvs Ala Val Glv Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser Gly Lys 
445 450 455 

ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC CGG TTT 
Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe Arg Phe 
460 465 470 

TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG GTG CTG 
Phe He Arg Arg Met Val Lys Ala Gin Ser Phe Tyr Trp Val Val Leu 
475 480 485 
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TGC GTG GTG GCC CTG AAC ACA CTG TGT GTG GCC ATG GTG CAT TAC AAC 
Cys Val Val Ala Leu Asn Thr Leu Cys Val Ala Met Val His Tyr Asn 
490 495 500 505 

CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT GTT TTC 
Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe Val Phe 
51 ° 515 520 

CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AAG ATG TAT GGC CTG GGG 
Leu Gly Leu Phe Leu Thr Glu Met Ser Leu Lys Met Tyr Gly Leu Gly 
525 530 535 

CCC AGA AGC TAC TTC CGG TCC TCC TTC AAC TGC TTC GAC TTT GGG GTC 
Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
540 545 550 

ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG CCG GGA 
lie Val Gly Ser Val Phe Glu Val Val Trp Ala Ala lie Lys Pro Gly 
555 560 5S5 

TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG AGG ATC 
Ser Ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He 
570 575 580 585 

TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG CGG AAC CTG GTG GTG TCC 
Phe Lys Val Thr Lys Tyr Trp Ser Ser Leu Arg Asn Leu Val Val Ser 
590 595 600 

CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG CTC TTC 
Leu Leu Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe 
605 610 615 

CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG CAG CTG TTT GGG GGA 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
620 625 630 

CAG TTC AAC TTC CAG GAT GAG ACT CCC ACA ACC AAC TTC GAC ACC TTC 
Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Thr Asn Phe Asp Thr Phe 
635 640 645 

CCT GCC GCC ATC CTC ACT GTC TTC CAG ATC CTG ACG GGA GAG GAC TGG 
Pro Ala Ala He Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
650 655 660 665 

AAT GCA GTG ATG TAT CAC GGG ATC GAA TCG CAA GGC GGC GTC AGC AAA 
Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val Ser Lys 
670 675 680 

GGC ATG TTC TCG TCC TTT TAC TTC ATT GTC CTG ACA CTG TTC GGA AAC 
Gly Met Phe Ser Ser Phe Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
685 690 695 

TAC ACT CTG CTG AAT GTC TTT CTG GCC ATC GCT GTG GAC AAC CTG GCC 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
700 705 710 
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AAC GCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA GCA GCC 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu Met Glu Glu Ala Ala 
715 720 725 

AAT CAG AAG CTT GCT CTG CAA AAG GCC AAA GAA GTG GCT GAA GTC AGC 
Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val Ser 
730 735 740 745 

CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAG AAC TCG 
Pro Met Ser Ala Ala Asn lie Ser lie Ala Ala Arg Gin Gin Asn Ser 
750 755 760 

GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA CGG CTG 
Ala Lys Ala Arg Ser Val Trp Glu Gin Arg Ala Ser Gin Leu Arg Leu 
765 770 775 

CAG AAC CTG CGG GCC AGC TGC GAG GCG CTG TAC AGC GAG ATG GAC CCC 
Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met Asp Pro 
780 785 790 

GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CTG CGG CCC GAC ATG 
Glu Glu Arg Leu Arg Phe Ala Thr Thr Arg His Leu Arg Pro Asp Met 
795 800 805 

AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC GAC GGC 
Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu Gly Arg Asp Gly 
810 815 820 825 

GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG GCC 
Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu Ala 
830 835 840 

CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG 
Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys 
845 850 855 

GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG 
Asp Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys 
860 865 870 

GCG GAG AGC GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG CCG CAC 
Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His 
875 880 885 

CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG 
Arg Ser His Ser Lys Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu 
890 895 900 905 

CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC CGG CGC 
Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His Arg Arg 
910 915 920 

GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC CGC GCG 
Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His Arg Ala 
925 930 935 
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CAC CGG CAC CAG GAT CCG AGO AAG GAG TGC GCC GGC GCC AAG GGC GAG 
His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys Gly Glu 
940 945 950 

CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC CGG GAG GCG 
Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg Glu Ala 
955 960 965 

GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC CGG GCC CGG CAC AAG GCG 
Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His Lys Ala 
970 975 980 9B5 

CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG GAG GCC 
Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys Glu Ala 
990 995 1000 

ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG GAG CTC 
Thr Glu Lys Glu Ala Glu He Val Glu Ala Asp Lys Glu Lys Glu Leu 
1005 1010 1015 

CGG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC AGT GGG 
Arg Asn His Gin Pro Arg Glu Pro His cys Asp Leu Glu Thr Ser Gly 
1020 1025 1030 

ACT GTG ACT GTG GGT CCC ATG CAC ACA CTG CCC AGC ACC TGT CTC CAG 
Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys Leu Gin 
1035 1040 1045 

AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC GTC ACT 
Lys val Glu Glu Gin Pro Glu Asp Ala Asp Asn Gin Arg Asn Val Thr 
1050 1055 1060 " 1065 

CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT ATC CCA 
Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr He Val His He Pro 
1070 1075 10B0 

GTG ATG CTG ACG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC AGT GGT 
Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro Ser Gly 
1085 1090 1095 

AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG GAA GCG 
Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val Glu Ala 
1100 lios mo 

GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC AGC TCC 
Asp Asp Val Met Arg Ser Gly Pro Arg Pro lie Val Pro Tyr Ser Ser 
1115 1120 1125 

ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC TGC CAC TAC 
Met Phe Cys Leu Ser Pro Thr Asn Leu Leu Arg Arg Phe Cys His Tyr 
H30 H35 1140 ' 1145 

ATC GTG ACC ATG AGG TAC TTC GAG GTG GTC ATT CTC GTG GTC ATC GCC 
He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val He Ala 
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TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA GAC TCG 3674 
Leu Ser Ser lie Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp Ser 
1165 1170 1175 

CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT GGT GTC 3 722 

Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr lie Phe Thr Gly Val 
1180 1185 1190 

TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG CTG CTT 3770 
Phe Thr Phe Glu Met Val lie Lys Met lie Asp Leu Gly Leu Leu Leu 
1195 1200 1205 

CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC TTC ATT 3818 
His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn lie Leu Asp Phe He 
1210 1215 1220 1225 

GTG GTC AGT GGC GCC CTG GTG GCG TTT GCT TTC TCA GGA TCC AAA GGG 3B66 
Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser Lys Gly 
1230 1235 1240 

AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC CTT CGT GTC CTG CGG 3914 
Lys Asp He Asn Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg 
1245 1250 1255 

CCC CTC AAG ACC ATC AAA CGG CTG CCC AAG CTC AAG GCT GTG TTT GAC 3962 
Pro Leu Lys Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp 
1260 1265 1270 

TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT GTC TAC 4 010 

Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu He Val Tyr 
1275 1280 1285 

ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG CAG CTC TTC AAA 4 058 

Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys 
1290 1295 1300 1305 

GGG AAG TTT TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG AGG GAC 4106 
Glv Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu Arg Asp 
1310 1315 1320 

TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CAG 4154 
Cys Arg Gly Gin Tyr Leu Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin 
1325 1330 1335 

CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG CTC TGG 42 02 

Pro Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val Leu Trp 
1340 1345 1350 

GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG CCC ATG 4250 
Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Met 
1355 1360 1365 

GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT CCA AGC 4298 
Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly Pro Ser 
1370 1375 1380 1385 
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CCT GGG TAC CGC ATG GAG CTG TCC ATC TTC TAC GTG GTC TAC TTT GTG 4346 
Pro Gly Tyr Arg Met Glu Leu Ser He Phe Tyr Val Val Tyr Phe Val 
1390 1395 1400 

GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC ATC ATC 4394 
Val Phe Pro Phe Phe Phe Val Asn He Phe Val Ala Leu He He He 
1405 1410 1415 

ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC CTG GAG 4442 
Thr Phe Gin Glu Gin Gly Asp Lys Val Met Ser Glu Cys Ser Leu Glu 
1420 1425 1430 

AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC GCC AAA CCC CTG 4490 
Lys Asn Glu Arg Ala Cys He Asp Phe Ala He Ser Ala Lys Pro Leu 
1435 1440 1445 

ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG ACG TGG 4538 
Thr Arg Tyr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys Thr Trp 
1450 1455 1460 1465 

ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC ATG ATA 4586 
Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe He Met Ala Met He 
1470 1475 1480 

GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA CCC TAT 4634 
Ala Leu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala Pro Tyr 
1485 1490 1495 

GAG TAC GAG CTG ATG CTG AAA TGC CTG AAC ATC GTG TTC ACA TCC ATG 4682 
Glu Tyr Glu Leu Met Leu Lys Cys Leu Asn He Val Phe Thr Ser Met 
1500 1505 1510 

TTC TCC ATG GAA TGC GTG CTG AAG ATC ATC GCC TTT GGG GTG CTG AAC 4730 
Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val Leu Asn 
1515 1520 1525 

TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG TTG GGA 4778 
Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val Leu Gly 
1530 1535 1540 1545 

AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCG GAA ACG AAC AAT TTC 4 826 

Ser He Thr Asp He Leu Val Thr Glu He Ala Glu Thr Asn Asn Phe 
1550 1555 1560 

ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 4874 
He Asn Leu Ser Phe Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys 
1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 4922 
Leu Leu Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val 
1580 1585 1590 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 4970 
Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu 
1595 1600 1605 
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TTC TTC ATC TAC GCC ATC ATC GGC ATG CAG GTG TTT GGG AAT ATT GCC 
Phe Phe lie Tyr Ala He He Gly Met Gin Val Phe Gly Asn He Ala 
1610 1615 1620 1625 

CTG GAT GAT GAC ACC AGC ATC AAC CGC CAC ARC AAC TTC CGG ACG TTT 
Leu Asp Asp Asp Thr Ser He Asn Arg His Asn Asn Phe Arg Thr Phe 
1630 1635 1640 

TTG CAA GCC CTG ATG CTG CTG TTC AGG AGC GCC ACG GGG GAG GCC TGG 
Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp 
1645 1650 1655 

CAC GAG ATC ATG CTG TCC TGC CTG AGC AAC CAG GCC TGT GAT GAG CAG 
His Glu He Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 1670 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC GTC 
Ala Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr Phe Tyr Phe Val 
1675 1680 1685 

TCC TTC ATC TTC CTG TGC TCC TTT CTG ATG TTG AAC CTC TTT GTG GCT 
Ser Phe He Phe Leu Cys Ser Phe Leu Met Leu Asn Leu Phe Val Ala 
1690 1695 1700 1705 

GTG ATC ATG GAC AAT TTT GAG TAC CTC ACG CGG GAC TCT TCC ATC CTA 
Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser lie Leu 
1710 1715 1720 

RGT CCT CAC CAC TTG GAT GAG TTC ATC CGG GTC TGG GCT GAA TAC GAC 
G?y p£o His His Leu Asp Glu Phe lie Arg Val Trp Ala Glu Tyr Asp 
1725 1 7 3° 1735 



CCG GCT GCG TGT 
Pro Ala Ala Cys 
1740 

AAA CAC ATG TCC 
Lys His Met Ser 
1755 

GTT GCT TAC AAG 
Val Ala Tyr Lys 
1770 

GAC ATG ACT GTT 
Asp Met Thr Val 



GGG CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG 
Glv Arg He Ser Tyr Asn Asp Met Phe Glu Met Leu 
1745 1750 

CCG CCT CTG GGG CTG GGG AAG AAA TGC CCT GCT CGA 
Pro Pro Leu Gly Leu Gly Lys Lys Cys Pro Ala Arg 
1760 1765 

CGC CTG GTT CGC ATG AAC ATG CCC ATC TCC AAC GAG 
Arg Leu Val Arg Met Asn Met Pro He Ser Asn Glu 
1775 1780 1785 

CAC TTC ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 

1800 



PPA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG CAT CAG TGT 
til Su Su He S! Leu Ala Pro Ala Gly Thr Lys Gin His Gin Cys 
1805 1810 1815 

£ s s; s s t% as s IS ?S 3 S£ ss s 

1820 1825 1830 
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SS ft* ^ ^ AC TTG CTG GTA CCA CCC ^ AAG CCT GAT GAG ATG 

Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu 
OJO 1840 1845 

*£?s s? is s s; a si s a s 2? s g; 

185S I860 2865 

CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC err 
Gin Asn Lys Thr Thr Arg Asp Gin Met Gin gT? Xla P?o G?J SS 
1870 1875 18 J 0 

Ser Gin mI? r?* £° T °. TG TCC CTG TTC ^ CCT CTG AAG GCC ACC CTG 
Ser Gin Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 
1885 1890 legs 

ri?, ^ G 5*° CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 

Glu Gin Thr Gin Pro Ala Val Leu Arg Gly Ala Arg Val Phe EeJ 
0 I 905 19X0 

Sin £5S c GT I CC S CC TCC CTC AGC ^ GGC GGG GCC ATA CAA AAC CAA 
Gin Lys Ser Ser Thr Ser Leu Ser Asn Gly Gly Ala He Gin Asn Sin 
1915 1920 1925 

GAG AGT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAG 
Glu Ser Gly He Lys Glu Ser Val Ser Trp Gly Thr gj £ Sfn 

9 u 1 93 5 1940 1945 

GAT GCA CCC CAT GAG GCC AGG CCA CCC CTG GAG CGT GGC CAC TCC ACA 
Asp Ala Pro His Glu Ala Arg Pro Pro Leu Glu Arg Gly His Ser Thr" 
1950 1955 i960 

GAG ATC CCT GTG GGG CGG TCA GGA GCA CTG GCT GTG GAC GTT CAG ATG 
Glu He Pro Val Gly Arg Ser Gly Ala Leu Ala Val Asp Val S£ Set 



CAG AGC ATA ACC CGG AGG GGC CCT GAT GGG GAG CCC CAG CCT GGG CTG 
Gin Ser lie Thr Arg Arg Gly Pro Asp Gly Glu Pro Gin Pro Gly Leu 
1980 1985 1990 

GAG AGC CAG GGT CGA GCG GCC TCC ATG CCC CGC CTT GCG GCC GAG ACT 
Glu Ser Gin Gly Arg Ala Ala Ser Met Pro Arg Leu Ala Ala Glu Thr 
1995 2000 2005 

CAG CCC GTC ACA GAT GCC AGC CCC ATG AAG CGC TCC ATC TCC ACG CTG 
?;?„ Pr ° Val Thr Asp Ser Pro Met Lys Arg Ser He Ser Thr Leu 

2 °1° 2015 2020 2025 

GCC CAG CGG CCC CGT GGG ACT CAT CTT TGC AGC ACC ACC CCG GAC CGC 
Ala Gin Arg Pro Arg Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arq 
2030 2035 2040 



6314 



WO 95/04822 



PCT/US94/09230 



-159- 



CGC CGC AGG GAC AGG AAG CAG AGG TCC CTG GAG AAG GGG CCC AGC CTG 
Arg Arg Arg Asp Arg Lys Gin Arg Ser Leu Glu Lys Sly Pro Ser Leu 
2060 2065 2070 

TCT GCC GAT ATG GAT GGC GCA CCA AGC AGT GCT GTG GGG CCG GGG CTG 
Ser Ala Asp Met Asp Gly Ala Pro Ser Ser Ala Val Gly Pro Gly Leu 
2075 2080 2085 

CCC CCG GGA GAG GGG CCT ACA GGC TGC CGG CGG GAA CGA GAG CGC CGG 
Pro Pro Gly Glu Gly Pro Thr Gly Cys Arg Arg Glu Arg Glu Arg Arg 
2090 2095 2100 2105 

CAG GAG CGG GGC CGG TCC CAG GAG CGG AGG CAG CCC TCA TCC TCC TCC 
Gin Glu Arg Gly Arg Ser Gin Glu Arg Arg Gin Pro Ser Ser Ser Ser 
2110 2115 2120 

TCG GAG AAG CAG CGC TTC TAC TCC TGC GAC CGC TTT GGG GGC CGT GAG 
Ser Glu Lys Gin Arg Phe Tyr Ser Cys Asp Arg Phe Gly Gly Arg Glu 
2125 2130 213 

CCC CCG AAG CCC AAG CCC TCC CTC AGC AGC CAC CCA ACG TCG CCA ACA 
Pro Pro Lys Pro Lys Pro Ser Leu Ser Ser His Pro Thr Ser Pro Tftr 
2140 2145 2150 

GCT GGC CAG GAG CCG GGA CCC CAC CCA CAG GCC GGC TCA GCC GTG GGC 
Ala Gly Gin Glu Pro Gly Pro His Pro Gin Ala Gly Ser Ala Val Gly 
2155 2160 2165 

TTT CCG AAC ACA ACG CCC TGC TGC AGA GAG ACC CCC TCA GCC AGC CCC 
Phe Pro Asn Thr Thr Pro Cys Cys Arg Glu Thr Pro Ser Ala Ser Pro^ 
2170 2175 



2180 



TGG CCC CTG GCT CTC GAA TTG GCT CTG ACC CTT ACC TGG GGC AGC GTC 
Trp Pro Leu Ala Leu Glu Leu Ala Leu Thr Leu Thr Trp Gly Ser Val 
2190 2195 2200 

TGG ACA GTG AGG CCT CTG TCC ACG CCC TGC CTG AGG ACA CGC TCA CTT 
ES ?£ Sal A^g Pro Leu Ser Thr Pro Cys Leu Arg Thr Arg S.r Leu 



Ser 



S SS 5 ™ » SiS «■ » s„ Gl * L " pr ° 



2220 

ACG TGT CCT CCC TGACCTCCCA GTCTCACCCT CTCCGCCGCG TGCCCAACGG 
Thr Cys Pro Pro 
2235 

TTACCACTGC ACCCTGGGAC TCAGCTCGGG TGGCCGAGCA CGGCACAGCT ACCACCACCC 
TGACCAAGAC CACTGGTGCT AGCTGCACCG TGACCGCTCA GACGCCTGCA TGCAGCAGGC 
GTGTGTTCCA GTGGATGAGT TTTATCATCC ACACGGGG C A GTCGGCCCTC GGGGGAGGCC 
TTGCCCACCT TGGTGAGGCT CCTGTGGCCC CTCCCTCCCC CTCCTCCCCT CTTTTACTCT 



6954 
7014 
7074 
7134 
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AGACGACGAA TAAAGCCCTG TTGCTTGAGT GTACGTACCG C 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1437 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1435.. 1546 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

JS? vlf f* 8 A S C AGC ATG TCC CGG GGC C CT TAC CCA CCC TCC CAG 

Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr pS So Hr Sn 
5 10 15 

n AG , t?" C S CC ATG GAG GTC ^ ^ C C CC AGC CCG CAG GGC AAA TAC AGC 
Glu lie Pro Met Glu Val Phe Asp Pro Ser Pro Gin G?y ^ ™ ™ 
25 30 

iSS iS 2?° CGA ^ *** CGG TCA GAT GGG ^C ACG TCC TCG GAT 

Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
■ s:> 40 45 " 

£k C TCC AAC AGC TTT G TC CGC CAG GGC TCA GCG GAG TCC TAC ACC 

Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala ifu Ser Jyf 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAR. GCP 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu gIu Hp 2g JfJ A?a 



TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu JJ! Sa 
85 so 95 

f5f ^ CC ^ G CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr As^ 
100 105 no 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ha lit ihr Se 
115 120 125 
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GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 
Glu Pro Lys Asp Phe Leu His lie Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 
Trp lie Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 16° 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 l" 70 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 

180 I 85 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GCC AAA CAG AAG CAG AAG TCG ACA GAG CAT GTG CCC CCC TAT GAC 
Ser Ala Lys Gin Lys Gin Lys Ser Thr Glu His Val Pro Pro Tyr Asp 
210 215 220 

GTG GTG CCT TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG 
Val Val Pro Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lys 
225 230 235 240 

GGC TAC GAG GTT ACA GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG 
Glv Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 

AAG CAT CGG TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT 
JJs Arg Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp 

ATT TCC CTG GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He 

275 280 
ATC ATT GAG CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT 
lie He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
290 295 300 

SS S SS 5 S S SS S S 5 S£ S S5 SS S 

305 31° 315 

S 25 E 2S S S JS S S S S S !S 1 = 

325 330 

S S S S S 55 & K S S 5 S S S 

340 345 
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CAA AGG CTC ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT 
Gin Arg Leu lie Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn 
355 360 365 

GTC CAA ATA GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG 
Val Gin lie Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met 
370 375 380 

TTT GAC ATC ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT 
Phe Asp He He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
385 390 395 400 

CTG GCG GAG TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC 
Leu Ala Glu Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 

AGC ACG CCA CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC 
Ser Thr Pro Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala 
420 425 430 

CTG GCT GCC AGC CCT GCC CCT GTC TCC AAC CTC CAG GTA CAG GTG CTC 
Leu Ala Ala Ser Pro Ala Pro Val Ser Asn Leu Gin Val Gin Val Leu 
435 440 445 

ACC TCG CTC AGG AGA AAC CTC GGC TTC TGG GGC GGG CTG GAG TCC TCA 
Thr Ser Leu Arg Arg Asn Leu Gly Phe Trp Gly Gly Leu Glu Ser Ser 
450 455 460 

CAG CGG GGC AGT GTG GTG CCC CAG GAG CAG GAA CAT GCC ATG TAGTGGGCGC 
Gin Arg Gly Ser Val Val Pro Gin Glu Gin Glu His Ala Met 
465 470 475 

CCTGCCCGTC TTCCCTCCTG CTCTGGGGTC GGAACTGGAG TGCAGGGAAC ATGGAGGAGG 
AAGGGAAGAG CTTTATTTTG TAAAAAAATA AGATGAGCGG CA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1797 

(D) OTHER INFORMATION: /standard 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1795.. 1851 



name= "Betal-3" 



1504 
1545 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
1 5 10 15 

GAG ATC CCC ATG GGA GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu lie Pro Met Gly Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lvs Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 HO 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Tip 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 15° 155 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 I 70 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 
Sg Sn isn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 1»0 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GCC AAA CAG AAG CAG AAG TCG ACA GAG CAT GTG CCC CCC TAT GAC 
stl Sa Lys Gin Lys Gin Lys Ser Thr Glu His Val Pro Pro Tyr Asp 
210 215 220 
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GTG GTG CCT TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG 
Val val Pro Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lys 
225 230 235 240 

GGC TAC GAG GTT ACA GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG 
Gly Tyr- Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 

AAG CAT CGG TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT 
Lys His Arg Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp 
260 265 270 

ATT TCC CTG GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He 
275 280 285 

ATC ATT GAG CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
290 295 300 

GAA ATC GAG CGA ATC TTC GAG CTG GCC CGG ACC CTT CAG TTG GTC GCT 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu val Ala 
305 310 315 320 

CTG GAT GCT GAC ACC ATC AAT CAC CCA GCC CAG CTG TCC AAG ACC TCG 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 

CTG GCC CCC ATC ATT GTT TAC ATC AAG ATC ACC TCT CCC AAG GTA CTT 
Leu Ala Pro He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu 
340 345 350 

CAA AGG CTC ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT 
Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn 
355 360 365 

GTC CAA ATA GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG 
Val Gin He Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met 
370 375 380 

TTT GAC ATC ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT 
Phe Asp He He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
385 390 395 " 400 

CTG GCG GAG TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC 
Leu Ala Glu Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 

AGC ACG CCA CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC 
Ser Thr Pro Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala 
420 425 430 

CTG GCT GCC AGC CCT GCC CCT GTC TCC AAC CTC CAG GGA CCC TAC CTT 
Leu Ala Ala Ser Pro Ala Pro Val Ser Asn Leu Gin Gly Pro Tyr Leu 
435 440 445 
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GCT TCC GGG GAC GAG CCA CTG GAA CGG GCC ACC GGG GAG CAC GCC AGC 
Ala Ser Gly Asp Gin Pro Leu Glu Arg Ala Thr Gly Glu His Ala Ser 
450 455 460 

ATG CAC GAG TAC CCA GGG GAG CTG GGC CAG CCC CCA GGC CTT TAC CCC 
Met His Glu Tyr Pro Gly Glu Leu Gly Gin Pro Pro Gly Leu Tyr Pro 
465 470 475 480 

AGC AGC CAC CCA CCA GGC CGG GCA GGC ACG CTA CGG GCA CTG TCC CGC 
Ser Ser His Pro Pro Gly Arg Ala Gly Thr Leu Arg Ala Leu Ser Arg 
485 490 495 

CAA GAC ACT TTT GAT GCC GAC ACC CCC GGC AGC CGA AAC TCT GCC TAC 
Gin Asp Thr Phe Asp Ala Asp Thr Pro Gly Ser Arg Asn Ser Ala Tyr 
500 505 510 

ACG GAG CTG GGA GAC TCA TGT GTG GAC ATG GAG ACT GAC CCC TCA GAG 
Thr Glu Leu Gly Asp Ser Cys Val Asp Met Glu Thr Asp Pro Ser Glu 
515 520 525 

GGG CCA GGG CTT GGA GAC CCT GCA GGG GGC GGC ACG CCC CCA GCC CGA 
Gly Pro Gly Leu Gly Asp Pro Ala Gly Gly Gly Thr Pro Pro Ala Arg 
530 535 540 

CAG GGA TCC TGG GAG GAC GAG GAA GAA GAC TAT GAG GAA GAG CTG ACC 
Gin Gly Ser Trp Glu Asp Glu Glu Glu Asp Tyr Glu Glu Glu Leu Thr 
545 550 555 560 

GAC AAC CGG AAC CGG GGC CGG AAT AAG GCC CGC TAC TGC GCT GAG GGT 
Asp Asn Arg Asn Arg Gly Arg Asn Lys Ala Arg Tyr Cys Ala Glu Gly 
565 570 575 

GGG GGT CCA GTT TTG GGG CGC AAC AAG AAT GAG CTG GAG GGC TGG GGA 
Glv Glv Pro Val Leu Gly Arg Asn Lys Asn Glu Leu Glu Gly Trp Gly 
580 585 590 

CGA GGC GTC TAC ATT CGC TGAGAGGCAG GGGCCACACG GCGGGAGGAA 
Arg Gly Val Tyr He Arg 
595 

GGGCTCTGAG CCCAGGGGAG GGGAGGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 35.. 3310 
(D) OTHER INFORMATION: 



/standard_name= "Alpha-2" 
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(ix) FEATURE: 

(A) NAME /KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3308.. 3600 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 

Met Ala Ala Gly Cys Leu 
1 5 
CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 is 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 ioo 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 no us 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe He 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 



BNSDOCID: <WO 9504822A1_I_> 



WO 95/04822 



PCT/US94/09230 



-167- 



CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
llu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 

185 190 
GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
Glu Pro Ser Leu Leu Trp Glu Val Phe Gly Ser Ala Thr Gly Leu 

200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
£s"n He Sip Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr lie Gin 

235 240 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu lie Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu lie Arg Thr Ser Val Ser giu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
nit llu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 

280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 

sir iS til Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 



Gil Liu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 



CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 

,,-1 ggj. i 

345 350 
ATG CTA TTC ACG GAT GGA GGA GAA GAG 
365 



« — AGA GCC CAG GAG ATA TTT AAC 

nel Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 

360 365 
m TAC AAT AAA OAT AAA AAA « «, ? TA TTC » TTT TCA CTT £T 

390 

CAA OC AAT TAT ?A0 » CCA CCT ATT CAC TCC ATC GCC TCT CAA AAC 



£ ™ « ^ 5 E ^ ™ M9 val s: Ms phe ser 

k GGA CCT ATT CAG TGG ATG GCC TGT 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys 
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AAA COT 'TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 
Lys Gly Tyr Tyr Tyr Glu lie Pro Ser He Gly Ala l£ J£ He JS 
415 420 

Thr Gin ^ T ^ ^ GTT GGA AGA CCA A TG GTT TTA GCA GGA 

Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
430 435 

^ GTC ^ TGG ACA ^T GTG TAC CTG GAT GCA TTG 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 
Glu Leu Gly Leu Val lie Thr Gly Thr Leu Pro Val p£e A^n i£ 

460 465 4 70 

G?v SSUrf T ^ ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 4 85 

vlt S2 Si v T ? I CT T TTG GAA GAT ATT AAA AGA CTG ACA CCA 

vai Met Gly val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Se S ?E Pro 
510 52,5 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin PrS £y"s J£ Pro L^ sS 
"° 525 530 

CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT 
Gin Glu Pro Val Thr Leu Asp Phe Leu Asp aS SxS S gJu 

540 545 S50 

ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA 
He Lys val Glu lie Arg Asn Lys Met lie Asp Gly G?5 HI g?J gJS 



555 ~ 560 



565 



AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC 
Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Jrg i£ J£ Sp 
570 575 580 

OS AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC 

Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr 
585 590 595 

stl Jlf ? ° C P G GTA TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC 
3 L6U Val LSU Pr ° Thr Ser phe ^ r ^ "e Lys Ala 

600 605 610 

S5i fI A ^ GAG ACA ATA ACT ^ GCC AGA TCA AAA AAG GGC AAA ATG 
Lys Leu Glu Glu Thr He Thr Gin Ala Arg Ser Lys Lys Gly Lys Met 
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615 620 625 630 

AAG GAT TCG GAA ACC CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT 1972 
Lys Asp Ser Glu Thr Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr 
635 640 645 

ACA TTC ATA GCA CCA AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT 2020 
Thr Phe lie Ala Pro Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp 
650 655 660 

AAT AAC ACT GAA TTT CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA 2068 
Asn Asn Thr Glu Phe Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys 
665 670 675 

ACT CCA AAC AAC CCA TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG 2116 
Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu 
680 685 690 

CTT GAT GCA GGC TTT ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG 2164 
Leu Asp Ala Gly Phe Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys 
695 700. 705 710 

CAG AAA AAT ATC AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT 2212 
Gin Lys Asn He Lys Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly 
715 720 725 

GGG ATT ACC AGA GTT TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA 2260 
Gly He Thr Arg Val Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu 
730 735 740 

AAC CCA GAG ACA TAT GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT 2 3 OB 

Asn Pro Glu Thr Tyr Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn 
745 * 750 755 

GAT AAC TAT GTT TTC ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT 23 56 

Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly 
760 765 770 

GCC TAT GAA TCG GGC ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT 24 04 

Ala Tyr Glu Ser Gly He Met Val Ser Lys Ala Val Glu He Tyr He 
775 780 785 790 

CAA GGG AAA CTT CTT AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA 2452 
Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly He Lys He Asp Val 
795 800 805 

AAT TCC TGG ATA GAG AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT 2500 
Asn Ser Trp He Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys 
810 815 820 

GCT GGT CCA GTT TGT GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT 2548 
Ala Gly Pro Val Cys Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys 
825 830 835 

GTG ATT CTG GAT GAT GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT 2596 
Val He Leu Asp Asp Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp 
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840 



845 



850 



f S i£ SS s S £5 s s s ss s s s s s 

860 865 870 

S S 2S S J 5 JS S E SS £ K S i£ £ E ^ 

88 0 885 

S g s s « - - « « s s ™ » g* « s 

895 900 

m s s S 22 £ s s s s s ss S 2 Si s 
ss i s s s s s s s s s s £ a s s 

925 g 30 

2 S 2 S S S S 2 S SS 22 52 SS IS? SS £ 

945 950 
GAT GAC TTC ACG GCC TCC CTG TCC AAG CAG AGC TGC ATT ACT GAa n&& 
Asp Asp Phe Thr Ala Ser Leu Ser Lys Gin Ser Jys J™ J£ SI G^ 
960 965 

S SS £ S 2 !S £S SS S 5S S SS sSr SS s s 

975 980 

GAC TGT GGA AAC TGT TCC AGA ATC TTT CAT GGA GAA AAG CTT ATC aar 
Asp Cys Oly Asn Cys Ser Arg He Phe His Gly" gJS £s S Se? A^n 
985 990 995 

Jhr 7™ t? A ^ ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT 

Thr Asn Leu He Phe He Met Val Glu Ser Lys oly Thr gl pS J^s 
1005 1010 

GAC ACA CGA CTG CTC ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT 
Asp Thr Arg Leu Leu lie Gin Ala Glu Gin Thr Ser Isp SlJ J£ 
1020 1025 10 30 

Pro Ml. ^ CCT AGA TAC CGA *** GGG CCT GA * GTC 

Pro Cys Asp Met Val Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val 
10 35 1040 10 45 

III HI £5 ^ T TTG GAG GAT TAT ACT GAC TG * GGT GGT GTT 

cys Phe Asp Asn Asn Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val 
1050 1055 1060 

S3 T TTA ** T CCC TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA 

Ser Gly Leu Asn Pro Ser Leu Trp Tyr lie He Gly He Gin . Phe Leu 
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CTA CTT TGG CTG GTA TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA 3317 
Leu Leu Trp Leu Val Ser Gly Ser Thr His Arg Leu Leu 
1080 1085 1090 

AAAACCAAAT CTGCATAGTT AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT 3377 

TACAGTAACG TAGGGTCAGC TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC 3437 

ATAACACTAA GGCGCAGACT CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC 3497 

TTAAACGTGT GTGAATGCTG CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG 3 557 

TCCTCTATTG GAAAATTTGG GCGTTTGTTG TTGCATTGTT GGT 3600 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCCCCTGCCA GTGGCCAAAC AGAAGCAGAA GTCGGGTAAT GAAATGACTA ACTTAGCCTT 6 0 

TGAACTAGAC CCCCTAGAGT TAGAGGAGGA AGAGGCTGAG CTTGGTGAGC AGAGTGGCTC 120 

TGCCAAGACT AGTGTTAGCA GTGTCACCAC CCCGCCACCC CATGGCAAAC GCATCCCCTT 180 

CTTTAAGAAG ACAGAGCATG TGCCCCCCTA TGACGTGGTG CCTTCCATGA GGCCCATCAT 24 0 

CCTGGTGGGA CCGTCGCTCA AGGGCTACGA GGTTACAGAC ATGATGCAGA AAGCTTTATT 300 

TGACTTCTTG AAGCATCGGT TTG 323 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCTATTGGTG TAGGTATACC AACAATTAAT TTAAGAAAAA GG AG AC C C AA TATCCAG 5 7 
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(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ix) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B) LOCATION: 1..132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO -14- 

2 s s s 2 s 2 s s 2 s 2 2 2 2 s 

S5 2 S 2 2 2 2 2 2 5 2 S 2 2 2 S 
2 2 2 2 S 2 S Kg 2 2 "a™"-™ 

35 40 
CGACCCTCAG GCTTCTTCCC AGGAAGCGGG G 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid- 
(A) DESCRIPTION: Oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AATTCGGTAC GTACACTCGA GC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid • 

(A) DESCRIPTION: Oligonucleotide 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
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GCTCGAGTGT ACGTACCG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCATGGTACC TTCGTTGACG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

AATTCGTCAA CGAAGGTACC ATGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 53 .. 1504 

(D) OTHER INFORMATION: /standard_name= 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCTCGGA CCCCCTGTCC CGGGGGAGGG GGAGAGCCCG 

TCT TTT TCT GAC TCC AGT GCA ACC TTC CTG CTG AAC GAG GGT TCA GCC 
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Ser Phe Ser Asp Ser Ser Ala Thr Phe Leu 



Leu Asn Glu Gly Ser Ala 



GAC TCC TAC ACC AGC CGC CCA TCT CTG GAC TCA GAC GTC TCC CTG GAr 
Asp ser Tyr Thr Ser Arg Pro Ser Leu Asp ser As£ Si Ser Zlu 2S 
20 25 30 

aiu aJ» ^ G c GT GCC CGG CGT GAA GTA GAG cag gct cag cag 
Glu Asp Arg Glu Ser Ala Arg Arg Glu Val Glu Ser Gin Ala Gin Gin 
40 45 

I C I C AGG GCC CAC *** CCT GTG GCA TTT GCG GTG AGG ACC 

Gin Leu Glu Arg Ala Lye His Lys Pro Val Ala Phe Ala Val Arg Thr 
55 eo g5 

OS* vll s2r Tvr S5 rf° S"? CTG GAT GAG GAG TGC CCA GTC CAG GGC 
Asn Val ser Tyr Cys Gly Val Leu Asp Glu Glu Cys Pro Val Gin Gly 

70 75 eo 

TCT GGA GTC AAC TTT GAG GCC AAA GAT TTT CTG CAC ATT AAA GAG AAG 
Ser Gly Val Asn Phe Glu Ala Lys Asp Phe Leu His S L^ 52 Jyf 
85 90 95 

Ser ^ ^ I GG ATC GGG CGG CTA GTG AAA GAG GGC GGG GAC 

Tyr ser Asn Asp Trp Trp He Gly Arg Leu Val Lys Glu Gly Gly Asp 

He A?° P^ C t^ C CCC AGC CCC CAG CGC CTG «" AGC ATC CGG CTC AAA 
He Ala Phe He Pro Ser Pro Gin Arg Leu Glu Ser He Arg Leu Lys 
115 120 !2S 

SS ctn w G GCC AGG AGA TCT GGG **= CCT TCC AQ C CTG AGT GAC 
Gin Glu Gin Lys Ala Arg Arg Ser Gly Asn Pro Ser Ser Leu Ser Asp 
■ LJ ° 13 5 140 14 5 

Til AAC CGA CGC TCC CCT CCG CCA TCT CTA GCC AAG CAG AAG CAA 

He Gly Asn Arg Arg Ser Pro Pro Pro Ser Leu Ala Lys Gin Lys Gin 
150 155 i e o 

^ GCG ^ GT ? CCC CCG TAT GAC CTG GTG CCC TCC ATG CGG 

Lys Gin Ala Glu His Val Pro Pro Tyr Asp Val Val Pro Ser Met Arg 
16 5 170 175 

Pro Val Hi Zl G v T ? ^? A ° CC TCT CTG GGT TAT GAG CTC ACA GAC 

Pro Val Val Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp 
180 185 igo 

£2 mI? ^ f* 0 GCT CTC TTC GAC 1X0 CTC ^ AGA TTT GAT GGC 

Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His Arg Phe Asp Gly 
195 200 205 

AGG ATC TCC ATC ACC CGA GTC ACA GCC GAC CTC TCC CTG GCA AAG CGA 
Arg He Ser lie Thr Arg Val Thr Ala Asp Leu Ser Leu Ala £y1! 

215 220 225 

TCT GTG CTC AAC AAT CCG GGC AAG AGG ACC ATC ATT GAG CGC TCC TCT 
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Ser Val Leu Asn Asn Pro Gly Lys Arg Thr lie lie Glu Arg Ser Ser 
230 235 240 

GCC CGC TCC AGC ATT GCG GAA GTG CAG AGT GAG ATC GAG CGC ATA TTT 823 
Ala Arg Ser Ser lie Ala Glu Val Gin Ser Glu lie Glu Arg lie Phe 
245 250 255 

GAG CTG GCC AAA TCC CTG CAG CTA GTA GTG TTG GAC GCT GAC ACC ATC 871 
Glu Leu Ala Lys Ser Leu Gin Leu Val Val Leu Asp Ala Asp Thr lie 
260 265 270 

AAC CAC CCA GCA CAG CTG GCC AAG ACC TCG CTG GCC CCC ATC ATC GTC 919 
Asn His Pro Ala Gin Leu Ala Lys Thr Ser Leu Ala Pro He He Val 
275 280 2B5 

TTT GTC AAA GTG TCC TCA CCA AAG GTA CTC CAG CGT CTC ATT CGC TCC 967 
Phe Val Lys Val Ser Ser Pro Lys Val Leu Gin Arg Leu He Arg Ser 
290 295 300 305 

CGG GGG AAG TCA CAG ATG AAG CAC CTG ACC GTA CAG ATG ATG GCA TAT 1015 
Arg Gly Lys Ser Gin Met Lys His Leu Thr Val Gin Met Met Ala Tyr 
310 315 320 

GAT AAG CTG GTT CAG TGC CCA CCG GAG TCA TTT GAT GTG ATT CTG GAT 1063 
Asp Lys Leu Val Gin Cys Pro Pro Glu Ser Phe Asp Val He Leu Asp 
325 330 335 

GAG AAC CAG CTG GAG GAT GCC TGT GAG CAC CTG GCT GAG TAC CTG GAG 1111 
Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala Glu Tyr Leu Glu 
340 345 350 

GTT TAC TGG CGG GCC ACG CAC CAC CCA GCC CCT GGC CCC GGA CTT CTG 115 9 

Val Tyr Trp Arg Ala Thr His His Pro Ala Pro Gly Pro Gly Leu Leu 
355 360 365 

GGT CCT CCC AGT GCC ATC CCC GGA CTT CAG AAC CAG CAG CTG CTG GGG 1207 
Gly Pro Pro Ser Ala He Pro Gly Leu Gin Asn Gin Gin Leu Leu Gly 
370 375 380 385 

GAG CGT GGC GAG GAG CAC TCC CCC CTT GAG CGG GAC AGC TTG ATG CCC 1255 
Glu Arg Gly Glu Glu His Ser Pro Leu Glu Arg Asp Ser Leu Met Pro 
390 395 400 

TCT GAT GAG GCC AGC GAG AGC TCC CGC CAA GCC TGG ACA GGA TCT TCA 13 03 

Ser Asp Glu Ala Ser Glu Ser Ser Arg Gin Ala Trp Thr Gly Ser Ser 
405 410 415 

CAG CGT AGC TCC CGC CAC CTG GAG GAG GAC TAT GCA GAT GCC TAC CAG 13 51 

Gin Arg Ser Ser Arg His Leu Glu Glu Asp Tyr Ala Asp Ala Tyr Gin 
420 425 430 

GAC CTG TAC CAG CCT CAC CGC CAA CAC ACC TCG GGG CTG CCT AGT GCT 13 99 

Asp Leu Tyr Gin Pro His Arg Gin His Thr Ser Gly Leu Pro Ser Ala 
435 440 445 

AAC GGG CAT GAC CCC CAA GAC CGG CTT CTA GCC CAG GAC TCA GAA CAC 144 7 
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Asn Gly His Asp Pro Gin Asp Arg Leu Leu Ala Gin Asp Ser Glu His 
455 460 



465 



Sin Ss ^ f GG ^ C TGG ^ CGC AAC CGG CCT TGG CCC AAG GAT 

Asn His Ser Asp Arg Asn Trp Gin Arg Asn Arg Pro Trp Pro Lys Asp 



480 

ter Tyr TGA CAG C CTCCTGCTGC CCTACCCTGG CAGGCACAGG 1543 
CGCAGCTGGC TGGGGGGCCC ACTCCAGGCA GGGTGGCGTT AGACTGGCAT 
CAGGCTGGCA CTAGGCTCAG CCCCCAAAAC CCCCTGCCCA GCCCCAGCTT CAGGGCTGCC 
TGTGGTCCCA AGGTTCTGGG AGAAACAGGG GACCCCCTCA CCTCCTGGGC AGTGACCCCT 
ACTAGGCTCC CATTCCAGGT ACTAGCTGTG TGTTCTGCAC CCCTGGCACC TTCCTCTCCT 
CCCACACAGG AAGCTGCCCC ACTGGGCAGT GCCCTCAGGC CAGGATCCCC TTAGCAGGGT 
CCTTCCCACC AGACTCAGGG AAGGGATGCC CCATTAAAGT GACAAAAGGG TGGGTGTGGG 
CACCATGGCA TGAGGAAGAA ACAAGGTCCC TGAGCAGGCA CAAGTCCTGA CAGTCAAGGG 
ACTGCTTTGG CATCCAGGGC CTCCAGTCAC CTCACTGCCA TACATTAGAA ATGAGACAAT 
TCAAAGCCCC CCCAGGGTGG CACACCCATC TGTTGCTGGG GTGTGGCAGC CACATCCAAG 
ACTGGAGCAG CAGGCTGGCC ACGCTTGGGC CAGAGAGAGC TCACAGCTGA AGCTCTTGGA 
GGGAAGGGCT CTCCTCACCC AATCG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 51.. 1492 
calcium channel* INFORMATION: /product= "A Beta3 subunit of human 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGCCCCCGGC GCCGCTCGTT CCCCCGACCC GGACTCCCCC ATGTATGACG ACTCCTACGT 
GCCCGGGTTT GAGGACTCGG AGGCGGTTTC AGCCGACTCC TACACCAGCC GCCCATCTCT 



1593 
1648 
1708 
1768 
1828 
1888 
1948 
2008 
2068 
2128 
2153 
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GGACTCAGAC GTCTCCCTGG AGGAGGACCG GGAGAGTGCC CGGCGTGAAG TAGAGAGCCA 180 

GGCTCAGCAG CAGCTCGAAA GGGCCAAGCA CAAACCTGTG GCATTTGCGG TGAGGACCAA 24 0 

TGTCAGCTAC TGTGGCGTAC TGGATGAGGA GTGCCCAGTC CAGGGCTCTG GAGTCAACTT 3 00 

TGAGGCCAAA GATTTTCTGC ACATTAAAGA GAAGTACAGC AATGACTGGT GGATCGGGCG 360 

GCTAGTGAAA GAGGGCGGGG ACATCGCCTT CATCCCCAGC CCCCAGCGCC TGGAGAGCAT 420 

CCGGCTCAAA CAGGAGCAGA AGGCCAGGAG ATCTGGGAAC CCTTCCAGCC TGAGTGACAT 480 

TGGCAACCGA CGCTCCCCTC CGCCATCTCT AGCCAAGCAG AAGCAAAAGC AGGCGGAACA 540 

TGTTCCCCCG TATGACGTGG TGCCCTCCAT GCGGCCTGTG GTGCTGGTGG GACCCTCTCT 600 

GAAAGGTTAT GAGGTCACAG ACATGATGCA GAAGGCTCTC TTCGACTTCC TCAAACACAG 660 

ATTTGATGGC AGGATCTCCA TCACCCGAGT CACAGCCGAC CTCTCCCTGG CAAAGCGATC 720 

TGTGCTCAAC AATCCGGGCA AGAGGACCAT CATTGAGCGC TCCTCTGCCC GCTCCAGCAT 78 0 

TGCGGAAGTG CAGAGTGAGA TCGAGCGCAT ATTTGAGCTG GCCAAATCCC TGCAGCTAGT 84 0 

AGTGTTGGAC GCTGACACCA TCAACCACCC AGCACAGCTG GCCAAGACCT CGCTGGCCCC 900 

CATCATCGTC TTTGTCAAAG TGTCCTCACC AAAGGTACTC CAGCGTCTCA TTCGCTCCCG 960 

GGGGAAGTCA CAGATGAAGC ACCTGACCGT ACAGATGATG GCATATGATA AGCTGGTTCA 1020 

GTGCCCACCG GAGTCATTTG ATGTGATTCT GGATGAGAAC CAGCTGGAGG ATGCCTGTGA 108 0 

GCACCTGGCT GAGTACCTGG AGGTTTACTG GCGGGCCACG CACCACCCAG CCCCTGGCCC 114 0 

CGGACTTCTG GGTCCTCCCA GTGCCATCCC CGGACTTCAG AACCAGCAGC TGCTGGGGGA 12 00 

GCGTGGCGAG GAGCACTCCC CCCTTGAGCG GGACAGCTTG ATGCCCTCTG ATGAGGCCAG 1260 

CGAGAGCTCC CGCCAAGCCT GGACAGGATC TTCACAGCGT AGCTCCCGCC ACCTGGAGGA 1320 

GGACTATGCA GATGCCTACC AGGACCTGTA CCAGCCTCAC CGCCAACACA CCTCGGGGCT 1380 

GCCTAGTGCT AACGGGCATG ACCCCCAAGA CCGGCTTCTA GCCCAGGACT CAGAACACAA 1440 

CCACAGTGAC CGGAACTGGC AGCGCAA.CCG GCCTTGGCCC AAGGATAGCT ACTGACAGCC 15 00 

TCCTGCTGCC CTACCCTGGC AGGCACAGGC GCAGCTGGCT GGGGGGCCCA CTCCAGGCAG 156 0 

GGTGGCGTTA GACTGGCATC AGGCTGGCAC TAGGCTCAGC CCCCAAAACC CCCTGCCCAG 1620 

CCCCAGCTTC AGGGCTGCCT GTGGTCCCAA GGTTCTGGGA GAAACAGGGG ACCCCCTCAC 168 0 

CTCCTGGGCA GTGACCCCTA CTAGGCTCCC ATTCCAGGTA CTAGCTGTGT GTTCTGCACC 174 0 

CCTGGCACCT TCCTCTCCTC CCACACAGGA AGCTGCCCCA CTGGGCAGTG CCCTCAGGCC 1800 
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AGGATCCCCT TAGCAGGGTC CTTCCCACCA GACTCAGGGA AGGGATGCCC CATTAAAGTG 1860 

ACAAAAGGGT GGGTGTGGGC ACCATGGCAT GAGGAAGAAA CAAGGTCCCT GAGCAGGCAC 1920 

AAGTCCTGAC AGTCAAGGGA CTGCTTTGGC ATCCAGGGCC TCCAGTCACC TCACTGCCAT 1980 

ACATTAGAAA TGAGACAATT CAAAGCCCCC CCAGGGTGGC ACACCCATCT GTTGCTGGGG 2040 

TGTGGCAGCC ACATCCAAGA CTGGAGCAGC AGGCTGGCCA CGCTTGGGCC AGAGAGAGCT 2100 

CACAGCTGAA GCTCTTGGAG GGAAGGGCTC TCCTCACCCA ATCG 2144 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

CTCAGTACCA TCTCTGATAC CAGCCCCA 28 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7808 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 237.. 7769 

(D) OTHER INFORMATION: /standard_name= "Alpha-1A-1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GATGTCCCGA GCTGCTATCC CCGGCTCGGC CCGGGCAGCC GCCTTCTGAG CCCCCGACCC 60 

GAGGCGCCGA GCCGCCGCCG CCCGATGGGC TGGGCCGTGG AGCGTCTCCG CAGTCGTAGC 120 

TCCAGCCGCC GCGCTCCCAG CCCCGGCAGC CTCAGCATCA GCGGCGGCGG CGGCGGCGGC 180 

GGCGTCTTCC GCATCGTTCG CCGCAGCGTA ACCCGGAGCC CTTTGCTCTT TGCAGA 236 

ATG GCC CGC TTC GGA GAC GAG ATG CCG GCC CGC TAC GGG GGA GGA GGC 284 
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Met Ala Arg Phe Gly Asp Glu Met Pro Ala Arg Tyr Gly Gly Gly Gly 
15 10 15 

TCC GGG GCA GCC GCC GGG GTG GTC GTG GGC AGC GGA GGC GGG CGA GGA 332 
Ser Gly Ala Ala Ala Gly Val Val Val Gly Ser Gly Gly Gly Arg Gly 
20 25 30 

GCC GGG GGC AGC CGG CAG GGC GGG CAG CCC GGG GCG CAA AGG ATG TAC 380 
Ala Gly Gly Ser Arg Gin Gly Gly Gin Pro Gly Ala Gin Arg Met Tyr 
35 40 45 

AAG CAG TCA ATG GCG CAG AGA GCG CGG ACC ATG GCA CTC TAC AAC CCC 428 
Lys Gin Ser Met Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro 
50 55 60 

ATC CCC GTC CGA CAG AAC TGC CTC ACG GTT AAC CGG TCT CTC TTC CTC 4 76 

lie Pro Val Arg Gin Asn Cys Leu Thr Val Asn Arg Ser Leu Phe Leu 
65 70 75 80 

TTC AGC GAA GAC AAC GTG GTG AGA AAA TAC GCC AAA AAG ATC ACC GAA 524 
Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys Lys He Thr Glu 
85 90 95 

TGG CCT CCC TTT GAA TAT ATG ATT TTA GCC ACC ATC ATA GCG AAT TGC 572 
Trp Pro Pro Phe Glu Tyr Met He Leu Ala Thr He He Ala Asn Cys 
100 105 110 

ATC GTC CTC GCA CTG GAG CAG CAT CTG CCT GAT GAT GAC AAG ACC CCG 62 0 

He Val Leu Ala Leu Glu Gin His Leu Pro Asp Asp Asp Lys Thr Pro 
115 120 125 

ATG TCT GAA CGG CTG GAT GAC ACA GAA CCA TAC TTC ATT GGA ATT TTT 668 
Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe 
130 135 140 

TGT TTC GAG GCT GGA ATT AAA ATC ATT GCC CTT GGG TTT GCC TTC CAC 716 
Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly Phe Ala Phe His 
145 150 155 160 

AAA GGC TCC TAC TTG AGG AAT GGC TGG AAT GTC ATG GAC TTT GTG GTG 764 
Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val 
165 170 175 

GTG CTA ACG GGC ATC TTG GCG ACA GTT GGG ACG GAG TTT GAC CTA CGG 812 
Val Leu Thr Gly He Leu Ala Thr Val Gly Thr Glu Phe Asp Leu Arg 
180 185 190 

ACG CTG AGG GCA GTT CGA GTG CTG CGG CCG CTC AAG CTG GTG TCT GGA 860 
Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly 
195 200 205 

ATC CCA AGT TTA CAA GTC GTC CTG AAG TCG ATC ATG AAG GCG ATG ATC 908 
He Pro Ser Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met He 
210 215 220 

CCT TTG CTG CAG ATC GGC CTC CTC CTA TTT TTT GCA ATC CTT ATT TTT 956 
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Pro Leu Leu Gin lie Gly Leu Leu Leu Phe Phe Ala He Leu He Phe 
225 230 235 240 

GCA ATC ATA GGG TTA GAA TTT TAT ATG GGA AAA TTT CAT ACC ACC TGC 
Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe His Thr Thr Cys 
245 250 255 

TTT GAA GAG GGG ACA GAT GAC ATT CAG GGT GAG TCT CCG GCT CCA TGT 
Phe Glu Glu Gly Thr Asp Asp He Gin Gly Glu Ser Pro Ala Pro Cys 
260 265 270 

GGG ACA GAA GAG CCC GCC CGC ACC TGC CCC AAT GGG ACC AAA TGT CAG 
Gly Thr Glu Glu Pro Ala Arg Thr Cys Pro Asn Gly Thr Lys Cys Gin 
275 280 285 

CCC TAC TGG GAA GGG CCC AAC AAC GGG ATC ACT CAG TTC GAC AAC ATC 
Pro Tyr Trp Glu Gly Pro Asn Asn Gly He Thr Gin Phe Asp Asn He 
290 295 300 

CTG TTT GCA GTG CTG ACT GTT TTC CAG TGC ATA ACC ATG GAA GGG TGG 
Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr Met Glu Gly Trp 
305 310 315 320 

ACT GAT CTC CTC TAC AAT AGC AAC GAT GCC TCA GGG AAC ACT TGG AAC 
Thr Asp Leu Leu Tyr Asn Ser Asn Asp Ala Ser Gly Asn Thr Trp Asn 
325 330 335 

TGG TTG TAC TTC ATC CCC CTC ATC ATC ATC GGC TCC TTT TTT ATG CTG 
Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe Phe Met Leu 
340 345 350 

AAC CTT GTG CTG GGT GTG CTG TCA GGG GAG TTT GCC AAA GAA AGG GAA 
Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys Glu Arg Glu 
355 360 365 

CGG GTG GAG AAC CGG CGG GCT TTT CTG AAG CTG AGG CGG CAA CAA CAG 
Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg Gin Gin Gin 
370 375 380 

ATT GAA CGT GAG CTC AAT GGG TAC ATG GAA TGG ATC TCA AAA GCA GAA 
He Glu Arg Glu Leu Asn Gly Tyr Met Glu Trp He Ser Lys Ala Glu 
385 390 395 400 

GAG GTG ATC CTC GCC GAG GAT GAA ACT GAC GGG GAG CAG AGG CAT CCC 
Glu Val He Leu Ala Glu Asp Glu Thr Asp Gly Glu Gin Arg His Pro 
405 410 415 

TTT GAT GGA GCT CTG CGG AGA ACC ACC ATA AAG AAA AGC AAG ACA GAT 
Phe Asp Gly Ala Leu Arg Arg Thr Thr He Lys Lys Ser Lys Thr Asp 
420 425 430 

TTG CTC AAC CCC GAA GAG GCT GAG GAT CAG CTG GCT GAT ATA GCC TCT 
Leu Leu Asn Pro Glu Glu Ala Glu Asp Gin Leu Ala Asp He Ala Ser 
435 440 445 

GTG GGT TCT CCC TTC GCC CGA GCC AGC ATT AAA AGT GCC AAG CTG GAG 
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Val Gly Ser Pro Phe Ala Arg Ala Ser He Lys Ser Ala Lys Leu Glu 
450 455 460 

AAC TCG ACC TTT TTT CAC AAA AAG GAG AGG AGG ATG CGT TTC TAC ATC 1676 
Asn Ser Thr Phe Phe His Lys Lys Glu Arg Arg Met Arg Phe Tyr He 
465 470 475 480 

CGC CGC ATG GTC AAA ACT CAG GCC TTC TAC TGG ACT GTA CTC AGT TTG 1724 
Arg Arg Met Val Lys Thr Gin Ala Phe Tyr Trp Thr Val Leu Ser Leu 
485 490 495 

GTA GCT CTC AAC ACG CTG TGT GTT GCT ATT GTT CAC TAC AAC CAG CCC 1772 
Val Ala Leu Asn Thr Leu Cys Val Ala He Val His Tyr Asn Gin Pro 
500 505 510 

GAG TGG CTC TCC GAC TTC CTT TAC TAT GCA GAA TTC ATT TTC TTA GGA 1820 
Glu Trp Leu Ser Asp Phe Leu Tyr Tyr Ala Glu Phe He Phe Leu Gly 
515 520 525 

CTC TTT ATG TCC GAA ATG TTT ATA AAA ATG TAC GGG CTT GGG ACG CGG 1868 
Leu Phe Met Ser Glu Met Phe He Lys Met Tyr Gly Leu Gly Thr Arg 
530 535 540 

CCT TAC TTC CAC TCT TCC TTC AAC TGC TTT GAC TGT GGG GTT ATC ATT 1916 
Pro Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Cys Gly Val He He 
545 550 555 560 

GGG AGC ATC TTC GAG GTC ATC TGG GCT GTC ATA AAA CCT GGC ACA TCC 1964 
Gly Ser He Phe Glu Val He Trp Ala Val He Lys Pro Gly Thr Ser 
565 570 575 

TTT GGA ATC AGC GTG TTA CGA GCC CTC AGG TTA TTG CGT ATT TTC AAA 2012 
Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He Phe Lys 
580 585 590 

GTC ACA AAG TAC TGG GCA TCT CTC AGA AAC CTG GTC GTC TCT CTC CTC 2060 
Val Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser Leu Leu 
595 600 605 

AAC TCC ATG AAG TCC ATC ATC AGC CTG TTG TTT CTC CTT TTC CTG TTC 2108 
Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe Leu Phe 
610 615 620 

ATT GTC GTC TTC GCC CTT TTG GGA ATG CAA CTC TTC GGC GGC CAG TTT 2156 
He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe 
.625 630 635 640 

AAT TTC GAT GAA GGG ACT CCT CCC ACC AAC TTC GAT ACT TTT CCA GCA 2204 
Asn Phe Asp Glu Gly Thr Pro Pro Thr Asn Phe Asp Thr Phe Pro Ala 
645 650 655 

GCA ATA ATG ACG GTG TTT CAG ATC CTG ACG GGC GAA GAC TGG AAC GAG 2252 
Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Glu 
660 665 670 

GTC ATG TAC GAC GGG ATC AAG TCT CAG GGG GGC GTG CAG GGC GGC ATG 23 00 
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Val Met Tyr Asp Gly He Lys Ser Gin Gly Gly Val Gin Gly Gly Met 
675 680 685 

GTG TTC TCC ATC TAT TTC ATT GTA CTG ACG CTC TTT GGG AAC TAC ACC 
Val Phe Ser He Tyr Phe lie Val Leu Thr Leu Phe Gly Asn Tyr Thr 
690 695 700 

CTC CTG AAT GTG TTC TTG GCC ATC GCT GTG GAC ART CTG GCC AAC GCC 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asn Xla 
705 710 715 720 

CAG GAG CTC ACC AAG GTG GAG GCG GAC GAG CAA GAG GAA GAA GAA GCA 
Gin Glu Leu Thr Lys Val Glu Ala Asp Glu Gin Glu Glu Glu Glu Ala 
7 25 730 735 

GCG AAC CAG AAA CTT GCC CTA CAG AAA GCC AAG GAG GTG GCA GAA GTG 
Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val 
740 745 ?50 

AGT CCT CTG TCC GCG GCC AAC ATG TCT ATA GCT GTG AAA GAG CAA CAG 
Ser Pro Leu Ser Ala Ala Asn Met Ser He Ala Val Lys Glu Gin Gin 
755 760 765 

AAG AAT CAA AAG CCA GCC AAG TCC GTG TGG GAG CAG CGG ACC AGT GAG 
Lys Asn Gin Lys Pro Ala Lys Ser Val Trp Glu Gin Arg Thr Ser Glu 
770 775 780 

ATG CGA AAG CAG AAC TTG CTG GCC AGC CGG GAG GCC CTG TAT AAC GAA 
Met Arg Lys Gin Asn Leu Leu Ala Ser Arg Glu Ala Leu Tyr Asn Glu 
785 790 795 ~ 800 

ATG GAC CCG GAC GAG CGC TGG AAG GCT GCC TAC ACG CGG CAC CTG CGG 
Met Asp Pro Asp Glu Arg Trp Lys Ala Ala Tyr Thr Arg His Leu Arg 
805 810 815 

CCA GAC ATG AAG ACG CAC TTG GAC CGG CCG CTG GTG GTG GAC CCG CAG 
Pro Asp Met Lys Thr His Leu Asp Arg Pro Leu Val Val Asp Pro Gin 
820 825 830 

GAG AAC CGC AAC AAC AAC ACC AAC AAG AGC CGG GCG GCC GAG CCC ACC 
Glu Asn Arg Asn Asn Asn Thr Asn Lys Ser Arg Ala Ala Glu Pro Thr 
835 840 845 

GTG GAC CAG CGC CTC GGC CAG CAG CGC GCC GAG GAC TTC CTC AGG AAA 
Val Asp Gin Arg Leu Gly Gin Gin Arg Ala Glu Asp Phe Leu Arg Lys 
850 855 860 

CAG GCC CGC TAC CAC GAT CGG GCC CGG GAC CCC AGC GGC TCG GCG GGC 
Gin Ala Arg Tyr His Asp Arg Ala Arg Asp Pro Ser Gly Ser Ala Gly 
865 870 875 880 

CTG GAC GCA CGG AGG CCC TGG GCG GGA AGC CAG GAG GCC GAG CTG AGC 
Leu Asp Ala Arg Arg Pro Trp Ala Gly Ser Gin Glu Ala Glu Leu Ser 
885 890 895 

CGG GAG GGA CCC TAC GGC CGC GAG TCG GAC CAC CAC GCC CGG GAG GGC 
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Arg Glu Gly Pro Tyr Gly Arg Glu Ser Asp His His Ala Arg Glu Gly 
900 905 910 

AGC CTG GAG CAA CCC GGG TTC TGG GAG GGC GAG GCC GAG CGA GGC AAG 
Ser Leu Glu Gin Pro Gly Phe Trp Glu Gly Glu Ala Glu Arg Gly Lys 
915 920 925 

GCC GGG GAC CCC CAC CGG AGG CAC GTG CAC CGG CAG GGG GGC AGC AGG 
Ala Gly Asp Pro His Arg Arg His Val His Arg Gin Gly Gly Ser Arg 
930 935 940 

GAG AGC CGC AGC GGG TCC CCG CGC ACG GGC GCG GAC GGG GAG CAT CGA 
Glu Ser Arg Ser Gly Ser Pro Arg Thr Gly Ala Asp Gly Glu His Arg 
945 950 955 960 

CGT CAT CGC GCG CAC CGC AGG CCC GGG GAG GAG GGT CCG GAG GAC AAG 
Arg His Arg Ala His Arg Arg Pro Gly Glu Glu Gly Pro Glu Asp Lys 
965 970 975 

GCG GAG CGG AGG GCG CGG CAC CGC GAG GGC AGC CGG CCG GCC CGG GGC 
Ala Glu Arg Arg Ala Arg His Arg Glu Gly Ser Arg Pro Ala Arg Gly 
980 985 990 

GGC GAG GGC GAG GGC GAG GGC CCC GAC GGG GGC GAG CGC AGG AGA AGG 
Gly Glu Gly Glu Gly Glu Gly Pro Asp Gly Gly Glu Arg Arg Arg Arg 
995 1000 1005 

CAC CGG CAT GGC GCT CCA GCC ACG TAC GAG GGG GAC GCG CGG AGG GAG 
His Arg His Gly Ala Pro Ala Thr Tyr Glu Gly Asp Ala Arg Arg Glu 
1010 1015 1020 

GAC AAG GAG CGG AGG CAT CGG AGG AGG AAA GAG AAC CAG GGC TCC GGG 
Asp Lys Glu Arg Arg His Arg Arg Arg Lys Glu Asn Gin Gly Ser Gly 
1025 1030 1035 1040 

GTC CCT GTG TCG GGC CCC AAC CTG TCA ACC ACC CGG CCA ATC CAG CAG 
Val Pro Val Ser Gly Pro Asn Leu Ser Thr Thr Arg Pro He Gin Gin 
1045 1050 1055 

GAC CTG GGC CGC CAA GAC CCA CCC CTG GCA GAG GAT ATT GAC AAC ATG 
Asp Leu Gly Arg Gin Asp Pro Pro Leu Ala Glu Asp He Asp Asn Met 
1060 1065 1070 

AAG AAC AAC AAG CTG GCC ACC GCG GAG TCG GCC GCT CCC CAC GGC AGC 
Lys Asn Asn Lys Leu Ala Thr Ala Glu Ser Ala Ala Pro His Gly Ser 
* 1075 1080 1085 

CTT GGC CAC GCC GGC CTG CCC CAG AGC CCA GCC AAG ATG GGA AAC AGC 
Leu Gly His Ala Gly Leu Pro Gin Ser Pro Ala Lys Met Gly Asn Ser 
1090 1095 HOO 

ACC GAC CCC GGC CCC ATG CTG GCC ATC CCT GCC ATG GCC ACC AAC CCC 
Thr Asp Pro Gly Pro Met Leu Ala He Pro Ala Met Ala Thr Asn Pro 
1105 1110 1H5 ll 20 

CAG AAC GCC GCC AGC CGC CGG ACG CCC AAC AAC CCG GGG AAC CCA TCC 
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Gln Asn Ala Ala Ser Arg Arg Thr Pro Asn Asn Pro Gly Asn Pro Ser 

1125 H3o 1135 

A^n ^° £ CC CCC ^ ACC CCC GAG AAT AGC CTT ATC GTC ACC AAC 

Asn Pro Gly Pro Pro Lys Thr Pro Glu Asn Ser Leu i£ vll J£ j£n 
1145 1150 

Prf »S C ^ ACC *** TCA GCT AAG ACT GCC AGG AAA CCC GAC 

Pro Ser Gly Thr Gin Thr Asn Ser Ala Lys Thr Ala Arg Lys Pro Sp 
1160 • xi65 

SS E E 2 S S 2 2 S SS S 2 2 2 £S SS 

1175 1180 

i£ Sal Si Gin S3 JJS A ?° ^ CCA GAC CCA CTG CCA AAA 

1185 ii qn y ASn Pr ° Pro Leu Pro L V S 

1195 3.200 

Si SS SS SS « S! SK SS K £ J£ SI S? SS 2 

1210 1215 
GGC CCT AAG CCA ATG CCT CCC TAT AGC TCC ATG TTC ATC CTG TCC arr 
Gly Pro Lys Pro Met Pro Pro Tyr Ser Ser Met 2£ He ler ?S 

1225 12 3o 

S SS £2 SS 2 2 2 SS S 3 £2 SS SS 
2 SS S SS 2 SS 22 S ES2 2 22 E 2 

1^ = = 1260 
GCC GCC GAG GAC CCT GTG CAG CCC AAC GCA CCT CGG AAC AAC GTG CTr 
Alalia Glu Asp Pro Val Gin Pro Asn Ala Pro Arg £n ™ vS Su 
U 1275 1280 

££ SI GAC I AC S™ TTT ACA GGC GTC TTC ACC TTT GAG ATG GTG 

Arg Tyr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val 

1285 1290 3.295 

ATC AAG ATG ATT GAC CTG GGG CTC GTC CTG CAT CAG GGT GCC TAC TTC 
He Lys Met lie Asp Leu Gly Leu Val Leu His Sn c£y aS pie 
1300 1305 133.0 

CG I GAC T CTC TGG AAT ATT CTC GAC TTC ATA GTG GTC ACT GGG GCC CTG 
Arg Asp Leu Tip Asn lie Leu Asp Phe He Val Val Ser Gly Ala LeS 
XJ - L = 1320 1325 

Phi oI C ACT GGC ^ AGC *** GGA AAA GAC ATC AAC ACG 

JXo Thr ASn Ser Lys G1 y Lys Asp He Asn Thr 

u 1335 3.3 40 

ATT AAA TCC CTC CGA GTC CTC CGG GTG CTA CGA CCT CTT AAA ACC ATC 
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AAG CGG CTG CCA AAG CTC AAG GCT GTG TTT GAC TGT GTG GTG AAC TCA 
Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Asn Ser 
1365 1370 1375 

CTT AAA AAC GTC TTC AAC ATC CTC ATC GTC TAC ATG CTA TTC ATG TTC 
Leu Lys Asn Val Phe Asn He Leu He Val Tyr Met Leu Phe Met Phe 
1380 1385 1390 

ATC TTC GCC GTG GTG GCT GTG CAG CTC TTC AAG GGG AAA TTC TTC CAC 
He Phe Ala Val Val Ala Val Gin Leu Phe Lys Gly Lys Phe Phe His 
1395 1400 1405 

TGC ACT GAC GAG TCC AAA GAG TTT GAG AAA GAT TGT CGA GGC AAA TAC 
Cys Thr Asp Glu Ser Lys Glu Phe Glu Lys Asp Cys Arg Gly Lys Tyr 
1410 1415 1420 

CTC CTC TAC GAG AAG AAT GAG GTG AAG GCG CGA GAC CGG GAG TGG AAG 
Leu Leu Tyr Glu Lys Asn Glu Val Lys Ala Arg Asp Arg Glu Trp Lys 
1425 1430 1435 1440 

AAG TAT GAA TTC CAT TAC GAC AAT GTG CTG TGG GCT CTG CTG ACC CTC 
Lys Tyr Glu Phe His Tyr Asp Asn Val Leu Txp Ala Leu Leu Thr Leu 
1445 1450 1455 

TTC ACC GTG TCC ACG GGA GAA GGC TGG CCA CAG GTC CTC AAG CAT TCG 
Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Lys His Ser 
1460 1465 1470 

GTG GAC GCC ACC TTT GAG AAC CAG GGC CCC AGC CCC GGG TAC CGC ATG 
Val Asp Ala Thr Phe Glu Asn Gin Gly Pro Ser Pro Gly Tyr Arg Met 
1475 1480 1485 

GAG ATG TCC ATT TTC TAC GTC GTC TAC TTT GTG GTG TTC CCC TTC TTC 
Glu Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe 
1490 1495 1500 

TTT GTC AAT ATC TTT GTG GCC TTG ATC ATC ATC ACC TTC CAG GAG CAA 
Phe Val Asn He Phe Val Ala Leu He He He Thr Phe Gin Glu Gin 
1505 1510 1515 1520 

GGG GAC AAG ATG ATG GAG GAA TAC AGC CTG GAG AAA AAT GAG AGG GCC 
Gly Asp Lys Met Met Glu Glu Tyr Ser Leu Glu Lys Asn Glu Arg Ala 
1525 1530 1535 

TGC ATT GAT TTC GCC ATC AGC GCC AAG CCG CTG ACC CGA CAC ATG CCG 
Cys He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg His Met Pro 
1540 1545 1550 

CAG AAC AAG CAG AGC TTC CAG TAC CGC ATG TGG CAG TTC GTG GTG TCT 
Gin Asn Lys Gin Ser Phe Gin Tyr Arg Met Trp Gin Phe Val Val Ser 
1555 1560 1565 

CCG CCT TTC GAG TAC ACG ATC ATG GCC ATG ATC GCC CTC AAC ACC ATC 
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Pr ° ^c?n Phe Glu Tyr Thr Ile Met Ala Met Ile Ala Leu Asn Thr Ile 
1570 1575 1580 

GTG CTT ATG ATG AAG TTC TAT GGG GCT TCT GTT GCT TAT GAA AAT GCC 
Val Leu Met Met Lys Phe Tyr Gly Ala Ser Val Ala Tyr Glu Asn Ala 
1585 1590 1595 1600 

CTG CGG GTG TTC AAC ATC GTC TTC ACC TCC CTC TTC TCT CTG GAA TGT 
Leu Arg Val Phe Asn lie Val Phe Thr Ser Leu Phe Ser Leu Glu Cys 
1605 1610 16 15 

GTG CTG AAA GTC ATG GCT TTT GGG ATT CTG AAT TAT TTC CGC GAT GCC 
Val Leu Lys Val Met Ala Phe Gly Ile Leu Asn Tyr Phe Arg Asp Ala 
1620 1625 1630 

^ C r? C GAC GTG ACT GTT CTG GGC AGC ATC ACC GAT ATC 

Trp Asn lie Phe Asp Phe Val Thr Val Leu Gly Ser Ile Thr Asp Ile 
I 635 1640 1645 

SS vJ? t£ T o AG «3 T GGG CCG *** ^ 770 ATC CTG AGC TTT 

Leu Val Thr Glu Phe Gly Asn Pro Asn Asn Phe He Asn Leu Ser Phe 
1650 1655 1660 

CTC CGC CTC TTC CGA GCT GCC CGG CTC ATC AAA CTT CTC CGT CAG GGT 
Leu Arg Leu Phe Arg Ala Ala Arg Leu lie Lys Leu Leu Arg Gin Gly 
1665 1670 1675 1680 

TAC ACC ATC CGC ATT CTT CTC TGG ACC TTT GTG CAG TCC TTC AAG GCC 
Tyr Thr lie Arg He Leu Leu Tip Thr Phe Val Gin Ser Phe Lys Ala 
1685 1690 1695 

CTG CCT TAT GTC TGT CTG CTG ATC GCC ATG CTC TTC TTC ATC TAT GCC 
Leu Pro Tyr Val Cys Leu Leu lie Ala Met Leu Phe Phe Ile Tyr Ala 
1700 1705 1710 

ATC ATT GGG ATG CAG GTG TTT GGT AAC ATT GGC ATC GAC GTG GAG GAC 
Ile Ile Gly Met Gin Val Phe Gly Asn lie Gly Ile Asp Val Glu Asp 
1715 1720 1725 

GAG GAC AGT GAT GAA GAT GAG TTC CAA ATC ACT GAG CAC AAT AAC TTC 
Glu Asp Ser Asp Glu Asp Glu Phe Gin Ile Thr Glu His Asn Asn Phe 
1730 1735 1740 

CGG ACC TTC TTC CAG GCC CTC ATG CTT CTC TTC CGG AGT GCC ACC GGG 
Arg Thr Phe Phe Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly 
l" 745 1750 1755 1760 

GAA GCT TGG CAC AAC ATC ATG CTT TCC TGC CTC AGC GGG AAA CCG TGT 
Glu Ala Trp His Asn Ile Met Leu Ser Cys Leu Ser Gly Lys Pro Cys 
1765 1770 1775 

GAT AAG AAC TCT GGC ATC CTG ACT CGA GAG TGT GGC AAT GAA TTT GCT 
Asp Lys Asn Ser Gly Ile Leu Thr Arg Glu Cys Gly Asn Glu Phe Ala 
1780 1785 1790 

TAT TTT TAC TTT GTT TCC TTC ATC TTC CTC TGC TCG TTT CTG ATG CTG 
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Tyr 


Phe 


Tyr Phe 
1795 


Val 


Ser 


Phe 


He Phe 
1800 


Leu 


Cys 


Ser 


Phe Leu 
1805 


Met 


Leu 


AAT 
Asn 


CTC TTT 
Leu Phe 
1810 


GTC 
Val 


GCC 
Ala 


GTC 
Val 


ATC ATG 
He Met 
1815 


GAC 
Asp 


AAC 
Asn 


TTT 
Phe 


GAG TAC 
Glu Tyr 
1820 


CTC 
Leu 


ACC 
Thr 


CGA 


GAC TCC 
Asp Ser 
1825 


TCC 
Ser 


ATC 
He 


CTG 
Leu 


GGC CCC 
Gly Pro 
1830 


CAC 
His 


CAC 
His 


CTG 
Leu 


GAT GAG 
Asp GlU 
1835 


TAC 
Tyr 


GTG 
Val 


CGT GTC 
Arg Val 
1840 


TGG 
Trp 


GCC 
Ala 


GAG 
Glu 


TAT GAC CCC 
Tyr Asp Pro 
1845 


GCA 
Ala 


GCT 
Ala 


TGG GGC CGC 
Trp Gly Arg 
1850 


ATG 
Met 


CCT 
Pro 


TAC 
Tyr 


CTG GAC 
Leu Asp 
1855 


ATG 
Met 


TAT 
Tyr 


CAG 
Gin 


ATG CTG 
Met Leu 
1860 


AGA 
Arg 


CAC 

His 


ATG 
Met 


TCT CCG 
Ser Pro 
1865 


CCC 
Pro 


CTG 
Leu 


GGT 
Gly 


CTG GGG 
Leu Gly 
1870 


AAG 
Lys 


AAG 
Lys 


TGT 

Cys 


CCG GCC 
Pro Ala 
1875 


AGA 
Arg 


GTG 
Val 


GCT 
Ala 


TAC AAG 
Tyr Lys 
1880 


CGG 
Arg 


CTT 
Leu 


CTG 
Leu 


CGG ATG 
Arg Met 
1885 


GAC 
Asp 


CTG 
Leu 


ccc 
Pro 


GTC GCA 
Val Ala 
1890 


GAT 
Asp 


GAC 
Asp 


AAC 
Asn 


ACC GTC 
Thr Val 
1895 


CAC 
His 


TTC 
Phe 


AAT 
Asn 


TCC ACC 
Ser Thr 
1900 


CTC 
Leu 


ATG 
Met 


GCT 
Ala 


CTG ATC 
Leu He 
1905 


CGC 
Arg 


ACA 
Thr 


GCC 
Ala 


CTG GAC 
Leu Asp 
1910 


ATC 
He 


AAG 
Lys 


ATT 

He 


GCC AAG 
Ala Lys 
1915 


GGA 
Gly 


GGA 
Gly 


GCC 
Ala 


GAC 
Asp 
1920 


AAA 

Lys 


CAG 
Gin 


CAG 
Gin 


ATG 

Met 


GAC GCT 
Asp Ala 
1925 


GAG 
Glu 


CTG 
Leu 


CGG 
Arg 


AAG GAG 
Lys Glu 
1930 


ATG 
Met 


ATG 
Met 


GCG 
Ala 


ATT TGG 
He Trp 
1935 


CCC 
Pro 


AAT 
Asn 


CTG 
Leu 


TCC CAG 
Ser Gin 
1940 


AAG 
Lys 


ACG 
Thr 


CTA 
Leu 


GAC CTG 
Asp Leu 
1945 


CTG 
Leu 


GTC 
Val 


ACA 
Thr 


CCT CAC 
Pro His 
1950 


AAG 
Lys 


TCC 
Ser 


ACG 
Thr 


GAC CTC 
Asp Leu 
1955 


ACC 
Thr 


GTG 
Val 


GGG 
Gly 


AAG ATC 
Lys He 
1960 


TAC 
Tyr 


GCA 
Ala 


GCC 
Ala 


ATG ATG 
Met Met 
1965 


ATC 
He 


ATG 
Met 


GAG 
Glu 


TAC TAC CGG 
Tyr Tyr Arg 
1970 


CAG 
Gin 


AGC 
Ser 


AAG GCC 
Lys Ala 
1975 


AAG 
Lys 


AAG 
Lys 


CTG 
Leu 


CAG GCC 
Gin Ala 
1980 


ATG 
Met 


CGC 
Arg 


GAG 
Glu 


GAG CAG 
Glu Gin 
1985 


GAC CGG 
Asp Arg 


ACA 
Thr 


CCC CTC 
Pro Leu 
1990 


ATG 
Met 


Phe 


CAG 
Gin 


CGC ATG 
Arg Met 
1995 


GAG 
Glu 


CCC 
Pro 


CCG 
Pro 


TCC 
Ser 
2000 


CCA 
Pro 


ACG 
Thr 


CAG 
Gin 


GAA 
Glu 


GGG GGA CCT GGC 
Gly Gly Pro Gly 
2005 


CAG 
Gin 


AAC GCC 
Asn Ala 
2010 


CTC 
Leu 


CCC 
Pro 


TCC 
Ser 


ACC CAG 
Thr Gin 
2015 


CTG 


GAC 


CCA 


GGA 


GGA 


GCC 


CTG 


ATG 


GCT 


CAC 


GAA AGC 


GGC 


CTC 


AAG 


GAG 
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Leu Asp Pro Gly Gly Ala Leu Met Ala His Glu Ser Gly Leu Lys Glu 
2020 2025 2030 

AGC CCG TCC TGG GTG ACC CAG CGT GCC CAG GAG ATG TTC CAG AAG ACG 
Ser Pro Ser Trp Val Thr Gin Arg Ala Gin Glu Met Phe Gin Lys Thr 
2035 2040 2045 

GGC ACA TGG AGT CCG GAA CAA GGC CCC CCT ACC GAC ATG CCC AAC AGC 
Gly Thr Trp Ser Pro Glu Gin Gly Pro Pro Thr Asp Met Pro Asn Ser 
2050 2055 2060 

CAG CCT AAC TCT CAG TCC GTG GAG ATG CGA GAG ATG GGC AGA GAT GGC 
?i!L Pr ° Asn Ser Gln Ser Val Glu Met ^9 Glu Met Gly Arg Asp Gly 
2065 2070 2075 2080 

TAC TCC GAC AGC GAG CAC TAC CTC CCC ATG GAA GGC CAG GGC CGG GCT 
Tyr Ser Asp Ser Glu His Tyr Leu Pro Met Glu Gly Gin Gly Arg Ala 
2085 2090 " 2095 

GCC TCC ATG CCC CGC CTC CCT GCA GAG AAC CAG AGG AGA AGG GGC CGG 
Ala Ser Met Pro Arg Leu Pro Ala Glu Asn Gin Arg Arg Arg Gly Arq 
2100 2105 2110 

CCA CGT GGG AAT AAC CTC AGT ACC ATC TCA GAC ACC AGC CCC ATG AAG 
Pro Arg Gly Asn Asn Leu Ser Thr He Ser Asp Thr Ser Pro Met Lys 
2115 2120 2125 

CGT TCA GCC TCC GTG CTG GGC CCC AAG GCC CGA CGC CTG GAC GAT TAC 
Arg Ser Ala Ser Val Leu Gly Pro Lys Ala Arg Arg Leu Asp Asp Tyr 
2130 2135 2140 

TCG CTG GAG CGG GTC CCG CCC GAG GAG AAC CAG CGG CAC CAC CAG CGG 
Ser Leu Glu Arg Val Pro Pro Glu Glu Asn Gin Arg His His Gin Arg 
2145 2150 2155 2160 

CGC CGC GAC CGC AGC CAC CGC GCC TCT GAG CGC TCC CTG GGC CGC TAC 
Arg Arg Asp Arg Ser His Arg Ala Ser Glu Arg Ser Leu Gly Arg Tyr 
2165 2170 2175 

ACC GAT GTG GAC ACA GGC TTG GGG ACA GAC CTG AGC ATG ACC ACC CAA 
Thr Asp Val Asp Thr Gly Leu Gly Thr Asp Leu Ser Met Thr Thr Gin 
2180 2185 2190 

TCC GGG GAC CTG CCG TCG AAG GAG CGG GAC CAG GAG CGG GGC CGG CCC 
Ser Gly Asp Leu Pro Ser Lys Glu Arg Asp Gin Glu Arg Gly Arg Pro 
2195 2200 2205 

AAG GAT CGG AAG CAT CGA CAG CAC CAC CAC CAC CAC CAC CAC CAC CAC 
Lys Asp Arg Lys His Arg Gin His His His His His His His His His 
2210 2215 2220 

CAT CCC CCG CCC CCC GAC AAG GAC CGC TAT GCC CAG GAA CGG CCG GAC 
Hxs Pro Pro Pro Pro Asp Lys Asp Arg Tyr Ala Gin Glu Arg Pro Asp 
2225 2230 2235 2240 

CAC GGC CGG GCA CGG GCT CGG GAC CAG CGC TGG TCC CGC TCG CCC AGC 
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His Gly Arg Ala Arg Ala Arg Asp Gin Arg Trp Ser Arg Ser Pro Ser 
2245 2250 2255 

GAG GGC CGA GAG CAC ATG GCG CAC CGG CAG GGC AGT AGT TCC GTA AGT 7052 
Glu Gly Arg Glu His Met Ala His Arg Gin Gly Ser Ser Ser Val Ser 
2260 2265 2270 

GGA AGC CCA GCC CCC TCA ACA TCT GGT ACC AGC ACT CCG CGG CGG GGC 7100 
Gly Ser Pro Ala Pro Ser Thr Ser Gly Thr Ser Thr Pro Arg Arg Gly 
2275 2280 2285 

CGC CGC CAG CTC CCC CAG ACC CCC TCC ACC CCC CGG CCA CAC GTG TCC 714 8 

Arg Arg Gin Leu Pro Gin Thr Pro Ser Thr Pro Arg Pro His Val Ser 
2290 2295 2300 

TAT TCC CCT GTG ATC CGT AAG GCC GGC GGC TCG GGG CCC CCG CAG CAG 7196 
Tyr Ser Pro Val He Arg Lys Ala Gly Gly Ser Gly Pro Pro Gin Gin 
2305 2310 2315 2320 

CAG CAG CAG CAG CAG CAG CAG CAG CAG GCG GTG GCC AGG CCG GGC CGG 7244 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Ala Val Ala Arg Pro Gly Arg 
2325 2330 2335 

GCG GCC ACC AGC GGC CCT CGG AGG TAC CCA GGC CCC ACG GCC GAG CCT 7292 
Ala Ala Thr Ser Gly Pro Arg Arg Tyr Pro Gly Pro Thr Ala Glu Pro 
2340 2345 2350 

CTG GCC GGA GAT CGG CCG CCC ACG GGG GGC CAC AGC AGC GGC CGC TCG 734 0 

Leu Ala Gly Asp Arg Pro Pro Thr Gly Gly His Ser Ser Gly Arg Ser 
2355 2360 2365 

CCC AGG ATG GAG AGG CGG GTC CCA GGC CCG GCC CGG AGC GAG TCC CCC 73 8 8 

Pro Arg Met Glu Arg Arg Val Pro Gly Pro Ala Arg Ser Glu Ser Pro 
2370 2375 2380 

AGG GCC TGT CGA CAC GGC GGG GCC CGG TGG CCG GCA TCT GGC CCG CAC 7436 
Arg Ala Cys Arg His Gly Gly Ala Arg Trp Pro Ala Ser Gly Pro His 
2385 2390 2395 2400 

GTG TCC GAG GGG CCC CCG GGT CCC CGG CAC CAT GGC TAC TAC CGG GGC 7484 
Val Ser Glu Gly Pro Pro Gly Pro Arg His His Gly Tyr Tyr Arg Gly 
2405 2410 2415 

TCC GAC TAC GAC GAG GCC GAT GGC CCG GGC AGC GGG GGC GGC GAG GAG 7 532 

Ser Asp Tyr Asp Glu Ala Asp Gly Pro Gly Ser Gly Gly Gly Glu Glu 
2420 2425 2430 

GCC ATG GCC GGG GCC TAC GAC GCG CCA CCC CCC GTA CGA CAC GCG TCC 7 58 0 

Ala Met Ala Gly Ala Tyr Asp Ala Pro Pro Pro Val Arg His Ala Ser 
2435 2440 2445 

TCG GGC GCC ACC GGG CGC TCG CCC AGG ACT CCC CGG GCC TCG GGC CCG 7628 
Ser Gly Ala Thr Gly Arg Ser Pro Arg Thr Pro Arg Ala Ser Gly Pro 
2450 2455 2460 

GCC TGC GCC TCG CCT TCT CGG CAC GGC CGG CGA CTC CCC AAC GGC TAC 7676 
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Ala Cys Ala Ser Pro Ser Arg His Gly Arg Arg Leu Pro Asn Gly Tyr 
2465 2470 2475 2480 

TAC CCG GCG CAC GGA CTG GCC AGG CCC CGC GGG CCG GGC TCC AGG AAG 7724 
Tyr Pro Ala His Gly Leu Ala Arg Pro Arg Gly Pro Gly Ser Arg Lys 
2485 2490 2495 

GGC CTG CAC GAA CCC TAC AGC GAG AGT GAC GAT GAT TGG TGC TAAGCCCGGG 7776 
Gly Leu His Glu Pro Tyr Ser Glu Ser Asp Asp Asp Trp Cys 
2500 2505 2510 

CGAGGTGGCG CCCGCCCGGC CCCCCACGCA CC 7806 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 237.. 7037 

(D) OTHER INFORMATION: /standard_name= "Alpha-lA-2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GATGTCCCGA GCTGCTATCC CCGGCTCGGC CCGGGCAGCC GCCTTCTGAG CCCCCGACCC 60 

GAGGCGCCGA GCCGCCGCCG CCCGATGGGC TGGGCCGTGG AGCGTCTCCG CAGTCGTAGC 120 

TCCAGCCGCC GCGCTCCCAG CCCCGGCAGC CTCAGCATCA GCGGCGGCGG CGGCGGCGGC 180 

GGCGTCTTCC GCATCGTTCG CCGCAGCGTA ACCCGGAGCC CTTTGCTCTT TGCAGA 236 

ATG GCC CGC TTC GGA GAC GAG ATG CCG GCC CGC TAC GGG GGA GGA GGC 284 
Met Ala Arg Phe Gly Asp Glu Met Pro Ala Arg Tyr Gly Gly Gly Gly 
15 10 15 

TCC GGG GCA GCC GCC GGG GTG GTC GTG GGC AGC GGA GGC GGG CGA GGA 332 
Ser Gly Ala Ala Ala Gly Val Val Val Gly Ser Gly Gly Gly Arg Gly 
20 25 30 

GCC GGG GGC AGC CGG CAG GGC GGG CAG CCC GGG GCG CAA AGG ATG TAC 380 
Ala Gly Gly Ser Arg Gin Gly Gly Gin Pro Gly Ala Gin Arg Met Tyr 
35 40 45 

AAG CAG TCA ATG GCG CAG AGA GCG CGG ACC ATG GCA CTC TAC AAC CCC 428 
Lys Gin Ser Met Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro 
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ATC CCC GTC CGA CAG AAC TGC CTC ACG GTT AAC CGG TCT CTC TTC CTC 
lie Pro Val Arg Gin Asn Cys Leu Thr Val Asn Arg Ser Leu Phe Leu 
65 70 75 80 

TTC AGC GAA GAC AAC GTG GTG AGA AAA TAC GCC AAA AAG ATC ACC GAA 
Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys Lys He Thr Glu 
85 90 95 

TGG CCT CCC TTT GAA TAT ATG ATT TTA GCC ACC ATC ATA GCG AAT TGC 
Trp Pro Pro Phe Glu Tyr Met He Leu Ala Thr He He Ala Asn Cys 
100 105 110 

ATC GTC CTC GCA CTG GAG CAG CAT CTG CCT GAT GAT GAC AAG ACC CCG 
He Val Leu Ala Leu Glu Gin His Leu Pro Asp Asp Asp Lys Thr Pro 
115 120 125 

ATG TCT GAA CGG CTG GAT GAC ACA GAA CCA TAC TTC ATT GGA ATT TTT 
Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe 
130 135 140 

TGT TTC GAG GCT GGA ATT AAA ATC ATT GCC CTT GGG TTT GCC TTC CAC 
Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly Phe Ala Phe His 
145 150 155 160 

AAA GGC TCC TAC TTG AGG AAT GGC TGG AAT GTC ATG GAC TTT GTG GTG 
Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val 
165 170 175 

GTG CTA ACG GGC ATC TTG GCG ACA GTT GGG ACG GAG TTT GAC CTA CGG 
Val Leu Thr Gly He Leu Ala Thr val Gly Thr Glu Phe Asp Leu Arg 
180 185 190 

ACG CTG AGG GCA GTT CGA GTG CTG CGG CCG CTC AAG CTG GTG TCT GGA 
Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly 
195 200 205 

ATC CCA AGT TTA CAA GTC GTC CTG AAG TCG ATC ATG AAG GCG ATG ATC 
He Pro Ser Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met He 
210 215 220 

CCT TTG CTG CAG ATC GGC CTC CTC CTA TTT TTT GCA ATC CTT ATT TTT 
Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala He Leu He Phe 
225 230 235 240 

GCA ATC ATA GGG TTA GAA TTT TAT ATG GGA AAA TTT CAT ACC ACC TGC 
Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe His Thr Thr Cys 
245 250 255 

TTT GAA GAG GGG ACA GAT GAC ATT CAG GGT GAG TCT CCG GCT CCA TGT 
Phe Glu Glu Gly Thr Asp Asp He Gin Gly Glu Ser Pro Ala Pro Cys 
260 265 270 

GGG ACA GAA GAG CCC GCC CGC ACC TGC CCC AAT GGG ACC AAA TGT CAG 
Gly Thr Glu Glu Pro Ala Arg Thr Cys Pro Asn Gly Thr Lys Cys Gin 
275 280 285 
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CCC TAC TGG GAA GGG CCC AAC AAC GGG ATC ACT CAG TTC GAC AAC ATC 
Pro Tyr Trp Glu Gly Pro Asn A S n Gly He Thr Sp j££ He 

295 300 

SI v T ? r CTG JP GT T 7X0 TCC ATA AC = ATG GAA GGG TGG 

Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr Met Glu Gly Tip 
310 315 320 

Thr 2£ t CTC r CTC I AC AGC AAC GAT GCC TCA GGG AAC ACT TGG AAC 

Thr Asp Leu Leu Tyr Asn Ser Asn Asp Ala Ser Gly Asn Thr Trp Asn 
325 330 335 

?SS 7I G I AC ^ C A T C CCC CTC ATC ATC ATC SGC TCC TTT TTT ATG CTG 
Trp Leu Tyr Phe lie Pro Leu lie He He Gly Ser Phe III Se? 22 
340 345 35 0 

2£ S ?S S SJ S £2 S S» SS S S 2£ Si £ S5 

" s 360 365 

CGG G J G GAG AAC CGG CGG GCT TTT CTG AAG CTG AGG CGG CAA CAA CAG 
Arg val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg Gin Sn~ Gin 
375 380 

nl a™ T CTC ** T GGG TAC ATG GAA TGG A TC TCA AAA GCA GAA 

He Glu Arg Glu Leu Asn Gly Tyr Met Glu Trp He Ser Lys Ala Glu 
390 395 400 

ciu S3 nf Aif GAT ^ ACT GAC GGG GAG CAG AGG <** c ce 

1U Val Ile Leu Ala Glu Asp Glu Thr Asp Gly Glu Gin Arg His Pro 
405 410 4 i 5 

pSe ££ r? A GCT T CTG CGG AGA ACC ACC ATA AAA AGC AAG ACA GAT 

Phe Asp Gly Ala Leu Arg Arg Thr Thr Ile Lys Lys Ser Lys Thr Asp 
420 425 430 

TTG CTC AAC CCC GAA GAG GCT GAG GAT CAG CTG GCT GAT ATA GCC TCT 
Leu Leu Asn Pro Glu Glu Ala Glu Asp Gin Leu Sa As? H* S£ 
435 440 445 

vlf S3 Ser" Pro ^ G ?° CGA GCC AGC ATT *** AGT GCC CTG GAG 

G =K ° Phe Ala ^ S Ala Ser Ile L y s Ser Ala Lys Leu Glu 

* so 455 460 

AAC TCG ACC TTT TTT CAC AAA AAG GAG AGG AGG ATG CGT TTC TAC ATC 
Asn Ser Thr Phe Phe His Lys Lys Glu Arg Arg Met Arg Phe Tyr n° 
465 470 475 4 8 o 

CGC CGC ATG GTC AAA ACT CAG GCC TTC TAC TGG ACT GTA CTC AGT TTG 
Arg Arg Met Val Lys Thr Gin Ala Phe Tyr Trp Thr Val Leu Ser Leu 
485 490 495 

GTA GCT CTC AAC ACG CTG TGT GTT GCT ATT GTT CAC TAC AAC CAG CCC 
Val Ala Leu Asn Thr Leu Cys Val Ala Ile Val His Tyr Asn Gin Pro 
500 505 510 
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GAG TGG CTC TCC GAC TTC CTT TAC TAT GCA GAA TTC ATT TTC TTA GGA 
Glu Trp Leu Ser Asp Phe Leu Tyr Tyr Ala Glu Phe lie Phe Leu Gly 
515 520 525 

CTC TTT ATG TCC GAA ATG TTT ATA AAA ATG TAC GGG CTT GGG ACG CGG 
Leu Phe Met Ser Glu Met Phe He Lys Met Tyr Gly Leu Gly Thr Arg 
530 535 540 

CCT TAC TTC CAC TCT TCC TTC AAC TGC TTT GAC TGT GGG GTT ATC ATT 
Pro Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Cys Gly Val He He 
545 550 555 560 

GGG AGC ATC TTC GAG GTC ATC TGG GCT GTC ATA AAA CCT GGC ACA TCC 
Gly Ser He Phe Glu Val He Trp Ala Val He Lys Pro Gly Thr Ser 
565 570 575 

TTT GGA ATC AGC GTG TTA CGA GCC CTC AGG TTA TTG CGT ATT TTC AAA 
Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He Phe Lys 
580 585 590 

GTC ACA AAG TAC TGG GCA TCT CTC AGA AAC CTG GTC GTC TCT CTC CTC 
Val Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser Leu Leu 
595 600 605 

AAC TCC ATG AAG TCC ATC ATC AGC CTG TTG TTT CTC CTT TTC CTG TTC 
Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe Leu Phe 
610 615 620 

ATT GTC GTC TTC GCC CTT TTG GGA ATG CAA CTC TTC GGC GGC CAG TTT 
He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe 
625 630 635 640 

AAT TTC GAT GAA GGG ACT CCT CCC ACC AAC TTC GAT ACT TTT CCA GCA 
Asn Phe Asp Glu Gly Thr Pro Pro Thr Asn Phe Asp Thr Phe Pro Ala 
645 650 655 

GCA ATA ATG ACG GTG TTT CAG ATC CTG ACG GGC GAA GAC TGG AAC GAG 
Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Glu 
660 665 670 

GTC ATG TAC GAC GGG ATC AAG TCT CAG GGG GGC GTG CAG GGC GGC ATG 
Val Met Tyr Asp Gly He Lys Ser Gin Gly Gly val Gin Gly Gly Met 
675 680 685 

GTG TTC TCC ATC TAT TTC ATT GTA CTG ACG CTC TTT GGG AAC TAC ACC 
Val Phe Ser He Tyr Phe He Val Leu Thr Leu Phe Gly Asn Tyr Thr 
690 695 700 

CTC CTG AAT GTG TTC TTG GCC ATC GCT GTG GAC AAT CTG GCC AAC GCC 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asn Ala 
705 710 715 720 

CAG GAG CTC ACC AAG GTG GAG GCG GAC GAG CAA GAG GAA GAA GAA GCA 
Gin Glu Leu Thr Lys Val Glu Ala Asp Glu Gin Glu Glu Glu Glu Ala 
725 730 735 
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GCG AAC CAG AAA CTT GCC CTA CAG AAA GCC AAG GAG GTG GCA GAA GTG 
Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val 
740 745 750 

AGT CCT CTG TCC GCG GCC AAC ATG TCT ATA GCT GTG AAA GAG CAA CAG 
Ser Pro Leu Ser Ala Ala Asn Met Ser He Ala Val Lys Glu Gin Gin 
755 760 765 

AAG AAT CAA AAG CCA GCC AAG TCC GTG TGG GAG CAG CGG ACC AGT GAG 
Lys Asn Gin Lys Pro Ala Lys Ser Val Trp Glu Gin Arg Thr Ser Glu 
""0 775 780 

ATG CGA AAG CAG AAC TTG CTG GCC AGC CGG GAG GCC CTG TAT AAC GAA 
Met Arg Lys Gin Asn Leu Leu Ala Ser Arg Glu Ala Leu Tyr Asn Glu 
785 790 795 ' BOO 

ATG GAC CCG GAC GAG CGC TGG AAG GCT GCC TAC ACG CGG CAC CTG CGG 
Met Asp Pro Asp Glu Arg Trp Lys Ala Ala Tyr Thr Arg His Leu Arg 
805 810 815 

CCA GAC ATG AAG ACG CAC TTG GAC CGG CCG CTG GTG GTG GAC CCG CAG 
Pro Asp Met Lys Thr His Leu Asp Arg Pro Leu Val Val Asp Pro Gin 
820 825 830 

GAG AAC CGC AAC AAC AAC ACC AAC AAG AGC CGG GCG GCC GAG CCC ACC 
Glu Asn Arg Asn Asn Asn Thr Asn Lys Ser Arg Ala Ala Qlu Pro Thr 
835 840 845 

GTG GAC CAG CGC CTC GGC CAG CAG CGC GCC GAG GAC TTC CTC AGG AAA 
Val Asp Gin Arg Leu Gly Gin Gin Arg Ala Glu Asp Phe Leu Art? Lvs 
850 855 860 

CAG GCC CGC TAC CAC GAT CGG GCC CGG GAC CCC AGC GGC TCG GCG GGC 
Gin Ala Arg Tyr His Asp Arg Ala Arg Asp Pro Ser Gly Ser Ala Glv 
865 870 875 880 

CTG GAC GCA CGG AGG CCC TGG GCG GGA AGC CAG GAG GCC GAG CTG AGC 
Leu Asp Ala Arg Arg Pro Trp Ala Gly Ser Gin Glu Ala Glu Leu Ser 
885 890 895 

CGG GAG GGA CCC TAC GGC CGC GAG TCG GAC CAC CAC GCC CGG GAG GGC 
Arg Glu Gly Pro Tyr Gly Arg Glu Ser Asp His His Ala Arg Glu Gly 
900 905 910 

AGC CTG GAG CAA CCC GGG TTC TGG GAG GGC GAG GCC GAG CGA GGC AAG 
Ser Leu Glu Gin Pro Gly Phe Trp Glu Gly Glu Ala Glu Arg Gly Lys 
915 920 925 

GCC GGG GAC CCC CAC CGG AGG CAC GTG CAC CGG CAG GGG GGC AGC AGG 
Ala Gly Asp Pro His Arg Arg His Val His Arg Gin Gly Gly Ser Arq 
930 935 940 

GAG AGC CGC AGC GGG TCC CCG CGC ACG GGC GCG GAC GGG GAG CAT CGA 
Glu Ser Arg Ser Gly Ser Pro Arg Thr Gly Ala Asp Gly Glu His Arg 
945 950 955 960 
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CGT CAT CGC GCG CAC CGC AGG CCC GGG GAG GAG GGT CCG GAG GAC AAG 3164 
Arg His Arg Ala His Arg Arg Pro Gly Glu Glu Gly Pro Glu Asp Lys 
965 970 975 

GCG GAG CGG AGG GCG CGG CAC CGC GAG GGC AGC CGG CCG GCC CGG GGC 3212 
Ala Glu Arg Arg Ala Arg His Arg Glu Gly Ser Arg Pro Ala Arg Gly 
980 985 990 

GGC GAG GGC GAG GGC GAG GGC CCC GAC GGG GGC GAG CGC AGG AGA AGG 326 0 

Gly Glu Gly Glu Gly Glu Gly Pro Asp Gly Gly Glu Arg Arg Arg Arg 
995 1000 1005 

CAC CGG CAT GGC GCT CCA GCC ACG TAC GAG GGG GAC GCG CGG AGG GAG 3308 
His Arg His Gly Ala Pro Ala Thr Tyr Glu Gly Asp Ala Arg Arg Glu 
1010 1015 1020 

GAC AAG GAG CGG AGG CAT CGG AGG AGG AAA GAG AAC CAG GGC TCC GGG 3356 
Asp Lys Glu Arg Arg His Arg Arg Arg Lys Glu Asn Gin Gly Ser Gly 
1025 1030 1035 1040 

GTC CCT GTG TCG GGC CCC AAC CTG TCA ACC ACC CGG CCA ATC CAG CAG 3404 
Val Pro Val Ser Gly Pro Asn Leu Ser Thr Thr Arg Pro lie Gin Gin 
1045 1050 1055 

GAC CTG GGC CGC CAA GAC CCA CCC CTG GCA GAG GAT ATT GAC AAC ATG 3452 
Asp Leu Gly Arg Gin Asp Pro Pro Leu Ala Glu Asp He Asp Asn Met 
1060 1065 1070 

AAG AAC AAC AAG CTG GCC ACC GCG GAG TCG GCC GCT CCC CAC GGC AGC 3 500 

Lys Asn Asn Lys Leu Ala Thr Ala Glu Ser Ala Ala Pro His Gly Ser 
1075 1080 1085 

CTT GGC CAC GCC GGC CTG CCC CAG AGC CCA GCC AAG ATG GGA AAC AGC 3548 
Leu Gly His Ala Gly Leu Pro Gin Ser Pro Ala Lys Met Gly Asn Ser 
1090 1095 1100 

ACC GAC CCC GGC CCC ATG CTG GCC ATC CCT GCC ATG GCC ACC AAC CCC 3596 
Thr Asp Pro Gly Pro Met Leu Ala He Pro Ala Met Ala Thr Asn Pro 
1105 1110 1115 1120 

CAG AAC GCC GCC AGC CGC CGG ACG CCC AAC AAC CCG GGG AAC CCA TCC 3644 
Gin Asn Ala Ala Ser Arg Arg Thr Pro Asn Asn Pro Gly Asn Pro Ser 
1125 1130 1135 

AAT CCC GGC CCC CCC AAG ACC CCC GAG AAT AGC CTT ATC GTC ACC AAC 3692 
Asn Pro Gly Pro Pro Lys Thr Pro Glu Asn Ser Leu He Val Thr Asn 
1140 1145 1150 

CCC AGC GGC ACC CAG ACC AAT TCA GCT AAG ACT GCC AGG AAA CCC GAC 374 0 

Pro Ser Gly Thr Gin Thr Asn Ser Ala Lys Thr Ala Arg Lys Pro Asp 
1155 1160 1165 

CAC ACC ACA GTG GAC ATC CCC CCA GCC TGC CCA CCC CCC CTC AAC CAC 3788 
His Thr Thr Val Asp He Pro Pro Ala Cys Pro Pro Pro Leu Asn His 
1170 1175 1180 
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^ GTC GTA GTG AA<= AAA AAC GCC AAC CCA GAC CCA CTG CCA AAA 

,fL Val Gln Val Asn Asn A 13 Asn Pro Asp Pro Leu Pro Lys 

1185 1195 1200 

AAA GAG GAA GAG AAG AAG GAG GAG GAG GAA GAC GAC CGT GGG GAA GAC 
Lys Glu Glu Glu Lys Lys Glu Glu Glu Glu Asp Asp Arg Gly Glu Asp 
1205 X210 1215 

GGC CCT AAG CCA ATG CCT CCC TAT AGC TCC ATG TTC ATC CTG TCC ACG 
Gly Pro Lys Pro Met Pro Pro Tyr Ser Ser Met Phe He Leu Ser Thr 
1220 1225 1230 

ACC AAC CCC CTT CGC CGC CTG TGC CAT TAC ATC CTG AAC CTG CGC TAC 
Thr Asn Pro Leu Arg Arg Leu Cys His Tyr He Leu Asn Leu Arg Tyr 
1235 1240 1245 

Phi m TG l GC A ? C CTC ATG GTC ATT GCC ATG AGC AGC ATC GCC CTG 

G ic„ et CyS Ile Leu Met Val Ile Ala Met Ser Ser He Ala Leu 
1250 1255 1260 

GCC GCC GAG GAC CCT GTG CAG CCC AAC GCA CCT CGG AAC AAC GTG CTG 
Ala Ala Glu Asp Pro Val Gin Pro Asn Ala Pro Arg Asn Asn Val Leu 
1265 1270 1275 1280 

CGA TAC TTT GAC TAC GTT TTT ACA GGC GTC TTC ACC TTT GAG ATG GTG 
Arg Tyr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val 
1285 1290 1295 

ATC AAG ATG ATT GAC CTG GGG CTC GTC CTG CAT CAG GGT GCC TAC TTC 
Ile Lys Met Ile Asp Leu Gly Leu Val Leu His Gin Gly Ala Tyr Phe 
1300 1305 1310 

CGT GAC CTC TGG AAT ATT CTC GAC TTC ATA GTG GTC AGT GGG GCC CTG 
Arg Asp Leu Trp Asn Ile Leu Asp Phe lie Val Val Ser Gly Ala Leu 
1315 1320 1325 

frI A GCC £7 GCC T ? C ACT 000 AAT AGC AAA GGA AAA GAC ATC AAC ACG 
Val Ala Phe Ala Phe Thr Gly Asn Ser Lys Gly Lys Asp Ile Asn Thr 
1330 1335 1340 

ATT AAA TCC CTC CGA GTC CTC CGG GTG CTA CGA CCT CTT AAA ACC ATC 
lie Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys Thr Ile 
1345 1350 1355 1360 

AAG CGG CTG CCA AAG CTC AAG GCT GTG TTT GAC TGT GTG GTG AAC TCA 
Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Asn Ser 
1365 1370 1375 

CTT AAA AAC GTC TTC AAC ATC CTC ATC GTC TAC ATG CTA TTC ATG TTC 
Leu Lys Asn Val Phe Asn Ile Leu Ile Val Tyr Met Leu Phe Met Phe 
1380 1385 1390 

ATC TTC GCC GTG GTG GCT GTG CAG CTC TTC AAG GGG AAA TTC TTC CAC 
Ile Phe Ala Val Val Ala Val Gin Leu Phe Lys Gly Lys Phe Phe His 
1395 1400 1405 
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TGC ACT GAC GAG TCC AAA GAG TTT GAG AAA GAT TGT CGA GGC AAA TAC 
Cys Thr Asp Glu Ser Lys Glu Phe Glu Lys Asp Cys Arg Gly Lys Tyr 
1410 1415 1420 

CTC CTC TAC GAG AAG AAT GAG GTG AAG GCG CGA GAC CGG GAG TGG AAG 
Leu Leu Tyr Glu Lys Asn Glu Val Lys Ala Arg Asp Arg Glu Trp Lys 
1425 1430 1435 1440 

AAG TAT GAA TTC CAT TAC GAC AAT GTG CTG TGG GCT CTG CTG ACC CTC 
Lys Tyr Glu Phe His Tyr Asp Asn Val Leu Trp Ala Leu Leu Thr Leu 
1445 1450 1455 

TTC ACC GTG TCC ACG GGA GAA GGC TGG CCA CAG GTC CTC AAG CAT TCG 
Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Lys His Ser 
1460 1465 1470 

GTG GAC GCC ACC TTT GAG AAC CAG GGC CCC AGC CCC GGG TAC CGC ATG 
Val Asp Ala Thr Phe Glu Asn Gin Gly Pro Ser Pro Gly Tyr Arg Met 
1475 1480 1485 

GAG ATG TCC ATT TTC TAC GTC GTC TAC TTT GTG GTG TTC CCC TTC TTC 
Glu Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe 



TTT GTC AAT ATC TTT GTG GCC TTG ATG ATC ATC ACC TTC CAG GAG CAA 
Phe Val Asn He Phe Val Ala Leu He He He Thr Phe Gin Glu Gin 
1505 1510 1515 1520 

GGG GAC AAG ATG ATG GAG GAA TAC AGC CTG GAG AAA AAT GAG AGG GCC 
Gly Asp Lys Met Met Glu Glu Tyr Ser Leu Glu Lys Asn Glu Arg Ala 
1525 1530 1535 

TGC ATT GAT TTC GCC ATC AGC GCC AAG CCG CTG ACC CGA CAC ATG CCG 
Cys lie Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg His Met Pro 
y 1540 1545 1550 

CAG AAC AAG CAG AGC TTC CAG TAC CGC ATG TGG CAG TTC GTG GTG TCT 
Gin Asn Lys Gin Ser Phe Gin Tyr Arg Met Trp Gin Phe Val Val Ser 
1555 1560 1565 

CCG CCT TTC GAG TAC ACG ATC ATG GCC ATG ATC GCC CTC AAC ACC ATC 
Pro Pro Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn Thr He 
1570 1575 1580 

GTG CTT ATG ATG AAG TTC TAT GGG GCT TCT GTT GCT TAT GAA AAT GCC 
Val Leu Met Met Lys Phe Tyr Gly Ala Ser Val Ala Tyr Glu Asn Ala 
1585 1590 1595 1S00 

CTG CGG GTG TTC AAC ATC GTC TTC ACC TCC CTC TTC TCT CTG GAA TGT 
Leu Arg Val Phe Asn He Val Phe Thr Ser Leu Phe Ser Leu Glu Cys 
1605 1610 1615 

GTG CTG AAA GTC ATG GCT TTT GGG ATT CTG AAT TAT TTC CGC GAT GCC 
52 22 S5 nil Ala Phe Gly He Leu Asn Tyr Phe Arg Asp Ala 

1620 1625 1630 
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5 SS S E £S SS S BS SSS ^ S 5S 5 S 
2S BE SS SS S S2 S JS SS S SH S S 

1655 1660 

225 2 SS S SS S 2 S S2 2 2 SS S 
SS 2 2 S £2 2 5 2 2 S3SS 15 2 SS 
2 2 SS S3 SS 2 2 S S2 °2 ss ss s £ s 
SS 2 Sj S3 SH S3 2 S? 22 2 2 2 S3 2 2? 

1720 1725 

ss 22 2 sj « 22 ss 2 S SS 2 2 2 

1735 1740 

22 2 2 SS SS SSI 2 ss ss s 2 2 2 s; 

SS S S SS SS S3 SSS 2 S2 £S5 2 55 
SS £S E a? 2 SIS 2 22 ^ 2 ^ see 
SS SS 22 S3 S 2 2 22 2 2 SS 25 SS 

1795 1800 1805 

to 22 S3 S S3 2 K 2 2 SS SS SSS 2 2 

1815 1820 

ler IS tT° t CTG S? C CCC ^ ^ CTG GAT GAG TAC GTG CGT GTC 
Asp Ser Ser He Leu Gly Pro His His Leu Asp Glu Tyr Val Arg Val 

1830 1835 1840 

SS S SS TyS 2p SSS to SS SS SS 2 S3 SSS SS £3 ss 

1845 1850 1855 
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ATG 
Met 


TAT 
Tyr 


CAG 
Gin 


ATG CTG AGA 
Met Leu Arg 
1860 


CAC ATG 
His Met 


TCT CCG 
Ser Pro 
1865 


CCC 
Pro 


CTG GGT CTG GGG AAG 
Leu Gly Leu Gly Lys 
1870 


5852 


AAG 
Lys 


TGT 
Cys 


CCG GCC 
Pro Ala 
1875 


AGA GTG 
Arg val 


GCT TAC AAG 
Ala Tyr Lys 
1880 


CGG 
Arg 


CTT 

Leu 


CTG CGG ATG 
Leu Arg Met 
1885 


GAC 
Asp 


CTG 
Leu 


5900 


CCC 
Pro 


GTC GCA 
Val Ala 
1890 


GAT 
Asp 


GAC AAC 
Asp Asn 


ACC GTC 
Thr Val 
1895 


CAC 
His 


TTC 
Phe 


AAT 
Asn 


TCC ACC 
Ser Thr 
1900 


CTC 
Leu 


ATG 
Met 


GCT 
Ala 


5948 


CTG ATC 
Leu lie 
1905 


CGC 
Arg 


ACA 
Thr 


GCC CTG GAC ATC 
Ala Leu Asp lie 
1910 


AAG 

Lys 


ATT 
lie 


GCC AAG 
Ala Lys 
1915 


GGA 
Gly 


GGA 
Gly 


GCC GAC 
Ala Asp 
1920 


5996 


AAA 


CAG 
Gin 


CAG 
Gin 


ATG 
Met 


GAC GCT 
Asp Ala 
1925 


GAG CTG 
Glu Leu 


CGG 
Arg 


AAG GAG 
Lys Glu 
1930 


ATG 
Met 


ATG 
Met 


GCG 
Ala 


ATT TGG 
lie Trp 
1935 


6044 


CCC 
Pro 


AAT 
Asn 


CTG 
Leu 


TCC CAG AAG 
Ser Gin Lys 
1940 


ACG CTA 
Thr Leu 


GAC CTG 
Asp Leu 
1945 


CTG 
Leu 


GTC 
Val 


ACA 
Thr 


CCT CAC 
Pro His 
1950 


AAG 

Lys 


6092 


TCC 

Ser 


ACG 
Thr 


GAC CTC 
Asp Leu 
1955 


ACC GTG 
Thr Val 


GGG AAG ATC 
Gly Lys lie 
i960 


TAC 
Tyr 


GCA 
Ala 


GCC 
Ala 


ATG ATG 
Met Met 
1965 


ATC 
lie 


ATG 
Met 


6140 


GAG 
Glu 


TAC TAC 
Tyr Tyr 
1970 


CGG 
Arg 


CAG AGC 
Gin Ser 


AAG GCC 
1975 


AAG 
Lys 


AAG 
Lys 


CTG 
Leu 


CAG GCC 
Gin Ala 
1980 


ATG 
Met 


CGC 
Arg 


GAG 
Glu 


6188 


GAG CAG GAC 
Glu Gin Asp 
1985 


CGG 
Arg 


ACA CCC CTC ATG 
Thr Pro Leu Met 
1990 


TTC 
Phe 


CAG 
Gin 


CGC ATG 
Arg Met 
1995 


GAG 
Glu 


CCC 
Pro 


CCG 
Pro 


TCC 
Ser 
2000 


6236 


CCA 
Pro 


ACG 
Thr 


CAG 
Gin 


GAA 
Glu 


GGG GGA CCT GGC 
Gly Gly Pro Gly 
2005 


CAG 
Gin 


AAC GCC 
Asn Ala 
2010 


CTC 
Leu 


CCC 
Pro 


TCC 

Ser 


ACC CAG 
Thr Gin 
2015 


6284 


CTG GAC 
Leu Asp 


CCA 
Pro 


GGA GGA GCC 
Gly Gly Ala 
2020 


CTG ATG 
Leu Met 


GCT CAC 
Ala His 
2025 


GAA 
Glu 


AGC 
Ser 


GGC 
Gly 


CTC AAG 
Leu Lys 
2030 


GAG 
Glu 


6332 


AGC 
Ser 


CCG 
Pro 


TCC 
Ser 


TGG GTG ACC 
Trp Val Thr 


CAG CGT 
Gin Arg 


GCC 
Ala 


CAG 
Gin 


GAG 
Glu 


ATG 
Met 


TTC 
Phe 


CAG 
Gin 


AAG 
Lys 


ACG 
Thr 


6380 



GGC ACA TGG AGT CCG GAA CAA GGC CCC CCT ACC GAC ATG CCC AAC AGC 
Gly Thr Trp Ser Pro Glu Gin Gly Pro Pro Thr Asp Met Pro Asn Ser 
2050 2055 2060 

CAG CCT AAC TCT CAG TCC GTG GAG ATG CGA GAG ATG GGC AGA GAT GGC 
Gin Pro Asn Ser Gin Ser Val Glu Met Arg Glu Met Gly Arg Asp Gly 
2065 2070 2075 2080 
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Sr Ser SS !S S2 ^ CT ° CCC ATG GAA GGC CAG GGC CGG G " 
Tyr Ser Asp Ser Glu His Tyr Leu Pro Met Glu Gly Gin Gly Arg Ala 

2085 2090 2095 

AS IS SI? £2! CGC t CTC CCT GCA GAG ^ ^ AGG AGA AGG GGC CGG 
Ala Ser Met Pro Arg Leu Pro Ala Glu Asn Gin Arg Arg Arg Gly Arg 
0 2105 2110 

Pro* S G G ? A^n ^ r CTC AGT ACC ATC TCA GAC ACC AGG CCC ATG AAG 
Pro Arg Gly Asn Asn Leu Ser Thr He Ser Asp Thr Ser Pro Met Lys 
" iJ " Lb 2120 2125 

Ser lli sfr v T ? t CTG GGC CCC *** GCC CGA CGC CTG GAC GAT TAC 
S?fo V Pr ° LyS Ala ^ ^3 Leu Asp Asp Tyr 

2135 2140 

lit 2S flu £22 v T ° » CG CCC GAG GAG ^ CAG CGG CAC CAC CAG CGG 
Ser Leu Glu Arg Val Pro Pro Glu Glu Asn Gin Arg His His Gin Arg 
2150 2155 2160 

Sa CGC o GC CGC GCC TCT GAG CGG TCC CTG GGC CGC TAC 

Arg Arg Asp Arg Ser His Arg Ala Ser Glu Arg Ser Leu Gly Arg Tyr 
2165 2170 2175 

£ CC . GAT S"? GAC ACA GGC 77(5 GSG ACA GAC CTG AGC ATG ACC ACC CAA 
Thr Asp val Asp Thr Gly Leu Gly Thr Asp Leu SeJ nit Thr G^ 
2180 2185 2190 

lor n AC t CTG CCG TCG ^ GAG CGG GAC CAG GAG CGG GGC CGG CCC 

Ser Gly Asp Leu Pro Ser Lys Glu Arg Asp Gin Glu Arg Gly Ar? Pro 
2195 2200 2205 

AAG GAT CGG AAG CAT CGA CAG CAC CAC CAC CAC CAC CAC CAC CAC CAC 
Lys Asp Arg Lys His Arg Gin His His His 2S £s £s 2s 2S 
^^■ LU 2215 2220 

SI So Pro p CG CCC f* 0 GAC CGC TAT GCC CAG GAA CGG CCG GAC 

Pro Pro Pro Asp Lys Asp Arg Tyr Ala Gin Glu Arg Pro Asp 

2230 2235 22 40 

His i?v f ^ CGG GCT CGG GAC CGC TGG TCC CGC TCG CCC AGC 

His Gly Arg Ala Arg Ala Arg Asp Gin Arg Trp Ser Arg Ser Pro Ser 
2245 2250 2 255 

G AG ggc S 2S S s c ffi S S c S S tagttccgta ag * ggaaggg 

2260 2265 
CAGCCCCCTC AACATCTGGT ACCAGCACTC CGCGGCGGGG CCGCCGCCAG CTCCCCCAGA 
CCCCCTCCAC CCCCCGGCCA CACGTGTCCT ATTCCCCTGT GATCCGTAAG GCCGGCGGCT 
CGGGGCCCCC GCAGCAGCAG CAGCAGCAGC AGGCGGTGGC CAGGCCGGGC CGGGCGGCCA 
CCAGCGGCCC TCGGAGGTAC CCAGGCCCCA CGGCCGAGCC TCTGGCCGGA GATCGGCCGC 



7114 
7174 
7234 
7294 
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CCACGGGGGG 


CGACAGCAGC 


GGCCGCTCGC 


CCAGGATGGA 


GAGGCGGGTC 


CCAGGCCCGG 


7354 


CCCGGAGCGA 


GTCCCCCAGG 


GCCTGTCGAC 


ACGGCGGGGC 


CCGGTGGCCG 


GCATCTGGCC 


7414 


CGCACGTGTC 


CGAGGGGCCC 


CCGGGTCCCC 


GGCACCATGG 


CTACTACCGG 


GGCTCCGACT 


7474 


ACGACGAGGC 


CGATGGCCCG 


GGCAGCGGGG 


GCGGCGAGGA 


GGCCATGGCC 


GGGGCCTACG 


7534 


ACGCGCCACC 


CCCCGTACGA CACGCGTCCT 


CGGGCGCCAC 


CGGGCGCTCG 


CCCAGGACTC 


7594 


CCCGGGCCTC 


GGGCCCGGCC 


TGCGCCTCGC 


CTTCTCGGCA CGGCCGGCGA CTCCCCAACG 


7654 


GCTACTACCC 


GGCGCACGGA 


CTGGCCAGGC 


CCCGCGGGCC 


GGGCTCCAGG 


AAGGGCCTGC 


7714 


ACGAACCCTA 


CAGCGAGAGT 
ACGCACC 


GACGATGATT 


GGTGCTAAGC 


CCGGGCGAGG 


TGGCGCCCGC 


7774 
7791 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 166.. 6921 

(D) OTHER INFORMATION: /standard_name= "Alpha- IE- 1" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCTGCTGCTG CCTCTCCGAA GAGCTCGCGG AGCTCCCCAG AGGCGGTGGT CCCCGTGCTT 

GTCTGGATGC GGCTCTGAGT CTCCGTGTGT CTTTCTGCTT GTTGCTGTGT GCGGGTGTTC 

GGCCGCGATC AC CTTTGTGT GTCTTCTGTC TGTTTAAACC TCAGG ATG GCT CGC 

Met Ala Arg 




222 




270 




318 
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TTG TAC AAC CCC ATT CCC GTC CGG CAG AAC TGT TTC ACC GTC AAC AGA 
Leu Tyr Asn Pro lie Pro Val Arg Gin Asn Cys Phe Thr Val Asn Arq 
55 60 65 

TCC CTG TTC ATC TTC GGA GAA GAT AAC ATT GTC AGG AAA TAT GCC AAG 
Ser Leu Phe lie Phe Gly Glu Asp Asn lie Val Arg Lys Tyr Ala Lys 
70 75 80 

AAG CTC ATC GAT TGG CCG CCA TTT GAG TAC ATG ATC CTG GCC ACC ATC 
Lys Leu lie Asp Trp Pro Pro Phe Glu Tyr Met lie Leu Ala Thr lie 
85 go 95 

ATT GCC AAC TGC ATC GTC CTG GCC CTG GAG CAG CAT CTT CCT GAG GAT 
lie Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Glu Asp 
100 105 110 X1 f 

GAC AAG ACC CCC ATG TCC CGA AGA CTG GAG AAG ACA GAA CCT TAT TTC 
Asp Lys Thr Pro Met Ser Arg Arg Leu Glu Lys Thr Glu Pro Tyr Phe 
120 125 130 

ATT GGG ATC TTT TGC TTT GAA GCT GGG ATC AAA ATT GTG GCC CTG GGG 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He Val Ala Leu Gly 
135 140 " 145 

TTC ATC TTC CAT AAG GGC TCT TAC CTC CGC AAT GGC TGG AAT GTC ATG 
Phe He Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
150 i 5 5 ~ 160 

GAC TTC ATC GTG GTC CTC AGT GGC ATC CTG GCC ACT GCA GGA ACC CAC 
Asp Phe He Val Val Leu Ser Gly He Leu Ala Thr Ala Gly Thr His 
l 65 170 175 

TTC AAT ACT CAC GTG GAC CTG AGG ACC CTC CGG GCT GTG CGT GTC CTG 
Phe Asn Thr His Val Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu 
180 185 190 i 9 5 

CGG CCT TTG AAG CTC GTG TCA GGG ATA CCT AGC CTG CAG ATT GTG TTG 
Arg Pro Leu Lys Leu Val Ser Gly He Pro Ser Leu Gin He Val Leu 
200 205 210 

AAG TCC ATC ATG AAG GCC ATG GTA CCT CTT CTG CAG ATT GGC CTT CTG 
Lys Ser He Met Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu 
215 220 225 

CTC TTC TTT GCC ATC CTG ATG TTT GCT ATC ATT GGT TTG GAG TTC TAC 
Leu Phe Phe Ala He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr 
230 235 240 * 

AGT GGC AAG TTA CAT CGA GCG TGC TTC ATG AAC AAT TCA GGT ATT CTA 
Ser Gly Lys Leu His Arg Ala Cys Phe Met Asn Asn Ser Gly He Leu 
245 250 255 

GAA GGA TTT GAC CCC CCT CAC CCA TGT GGT GTG CAG GGC TGC CCA GCT 
Glu Gly Phe Asp Pro Pro His Pro Cys Gly Val Gin Gly Cys Pro Ala 
260 265 270 275 
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GGT TAT GAA TGC AAG GAC TGG ATC GGC CCC AAT GAT GGG ATC ACC CAG 
Gly Tyr Glu Cys Lys Asp Trp He Gly Pro Asn Asp Gly He Thr Gin 
280 2B5 290 

TTT GAT AAC ATC CTT TTT GCT GTG CTG ACT GTC TTC CAG TGC ATC ACC 
Phe Asp Asn He Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr 
295 300 305 

ATG GAA GGG TGG ACC ACT GTG CTG TAC AAT ACC AAT GAT GCC TTA GGA 
Met Glu Gly Trp Thr Thr Val Leu Tyr Asn Thr Asn Asp Ala Leu Gly 
310 315 320 

GCC ACC TGG AAT TGG CTG TAC TTC ATC CCC CTC ATC ATC ATT GGA TCC 
Ala Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser 
325 330 335 

TTC TTT GTT CTC AAC CTA GTC CTG GGA GTG CTT TCC GGG GAA TTT GCC 
Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala 
340 345 350 355 

AAA GAG AGA GAG AGA GTG GAG AAC CGA AGG GCT TTC ATG AAG CTG CGG 
Lys Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Met Lys Leu Arg 
360 365 370 

CGC CAG CAG CAG ATT GAG CGT GAG CTG AAT GGC TAC CGT GCC TGG ATA 
Arg Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Arg Ala Trp He 
375 380 385 

GAC AAA GCA GAG GAA GTC ATG CTC GCT GAA GAA AAT AAA AAT GCT GGA 
Asp Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asn Lys Asn Ala Gly 
390 395 400 

ACA TCC GCC TTA GAA GTG CTT CGA AGG GCA ACC ATC AAG AGG AGC CGG 
Thr Ser Ala Leu Glu Val Leu Arg Arg Ala Thr He Lys Arg Ser Arg 
405 410 415 

ACA GAG GCC ATG ACT CGA GAC TCC AGT GAT GAG CAC TGT GTT GAT ATC 
Thr Glu Ala Met Thr Arg Asp Ser Ser Asp Glu His Cys Val Asp He 
420 425 430 435 

TCC TCT GTG GGC ACA CCT CTG GCC CGA GCC AGT ATC AAA AGT GCA AAG 
Ser Ser Val Gly Thr Pro Leu Ala Arg Ala Ser He Lys Ser Ala Lys 
440 445 450 

GTA GAC GGG GTC TCT TAT TTC CGG CAC AAG GAA AGG CTT CTG CGC ATC 
Val Asp Gly Val Ser Tyr Phe Arg His Lys Glu Arg Leu Leu Arg He 
455 460 465 

TCC ATT CGC CAC ATG GTT AAA TCC CAG GTG TTT TAC TGG ATT GTG CTG 
Ser He Arg His Met Val Lys Ser Gin Val Phe Tyr Trp He Val Leu 
470 475 480 

AGC CTT GTG GCA CTC AAC ACT GCC TGT GTG GCC ATT GTC CAT CAC AAC 
Ser Leu Val Ala Leu Asn Thr Ala Cys Val Ala He Val His His Asn 
485 490 495 
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CAG CCC CAG TGG CTC ACC CAC CTC CTC TAC TAT GCA GAA TTT CTG TTT 
Gin Pro Gin Trp Leu Thr His Leu Leu Tyr Tyr Ala gTS £1 Leu Se 
S00 505 510 515 

22 T° TC r TC J TG GAG ATG TCC CTG ATG TAT GGC ATG GGG 

Leu Gly Leu Phe Leu Leu Glu Met Ser Leu Lys Met Tyr Gly Met Glv 
520 525 / 530 * 

CCT CGC CTT TAT TTT CAC TCT TCA TTC AAC TGC TTT GAT TTT GGG GTC 
Pro Arg Leu Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Phe Gly Sal 
535 540 545 

tS S2 G?v Kvl tT C ir ^ ?, TG GTC TGG GCA ATC TTC AGA CC * GGT 
Thr Val Gly Ser He Phe Glu Val Val Trp Ala He Phe Arg Pro Gly 

= 50 555 560 

Th? Jer Phi i?5 t?° q GT f, TC CGA GCC CTC CGG CTT CTA AGA ATA 

Thr ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He 
565 570 575 

52 t? A tu C ™ TGG GCT TCC CTA CGG AAT TTG GTG GTC TCC 

sao 7 116 Thr Lys ^ Trp 7113 Ser Leu ^9 ten Leu Val Val Ser 
580 585 590 595 

Jlf, m TG o GC ^ ATG **° TCT ATC ATC AG * TTG CTT TTC CTC CTC TTC 
Leu Met Ser Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Se 
600 605 610 

CTC TTC ATC GTT GTC TTT GCT CTC CTA GGA ATG CAG TTA TTT GGA GGC 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met gYh Leu S3 tS.y 
615 620 625 

AGG TTT AAC TTT AAT GAT GGG ACT CCT TCG GCA AAT TTT GAT ACC TTC 
Arg Phe Asn Phe Asn Asp Gly Thr Pro Ser Ala Asn Phe 2p J£ Se 
630 635 640 

GC ° A T C ATG ACT GTG TTC ^ ATC CTG ACG GGT GAG GAC TGG 
Pro Ala Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
645 650 655 

AAT GAG GTG ATG TAC AAT GGG ATC CGC TCC CAG GGT GGG GTC AGC TCA 
Asn Glu Val Met Tyr Asn Gly He Arg Ser Gin Gly Gly Val Ser Ser 
660 66 5 670 675 

r?5 £ TG - 1°° I CT GCC ATC TAC TTC ATT GTG CTC ACC ^TG TTT GGC AAC 
Gly Met Trp Ser Ala He Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
680 685 690 

TAC ACG CTA CTG AAT GTG TTC TTG GCT ATC GCT GTG GAT AAT CTC GCC 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
695 700 705 

AAC GCC CAG GAR CTG ACC AAG GAT GAA CAG GAG GAA GAA GAG GCC TTC 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Gin Glu Glu Glu Glu Ala Phe 
710 715 720 
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AAC CAG AAA CAT GCA CTG CAG AAG GCC AAG GAG GTC AGC CCG ATG TCT 
Asn Gin Lys His Ala Leu Gin Lys Ala Lys Glu Val Ser Pro Met Ser 
725 730 735 

GCA CCC AAC ATG CCT TCG ATC GAG AGG GAG CGG AGG CGC CGG CAC CAC 
Ala Pro Asn Met Pro Ser He Glu Arg Glu Arg Arg Arg Arg His His 
740 745 750 755 

ATG TCC GTG TGG GAG CAG CGT ACC AGC CAG CTG AGG AAG CAC ATG CAG 
Met Ser Val Trp Glu Gin Arg Thr Ser Gin Leu Arg Lys His Met Gin 
760 765 770 

ATG TCC AGC CAG GAG GCC CTC AAC AGA GAG GAG GCG CCG ACC ATG AAC 
Met Ser Ser Gin Glu Ala Leu Asn Arg Glu Glu Ala Pro Thr Met Asn 
775 780 785 

CCG CTC AAC CCC CTC AAC CCG CTC AGC TCC CTC AAC CCG CTC AAT GCC 
Pro Leu Asn Pro Leu Asn Pro Leu Ser Ser Leu Asn Pro Leu Asn Ala 
790 795 800 

CAC CCC AGC CTT TAT CGG CGA CCC AGG GCC ATT GAG GGC CTG GCC CTG 
His Pro Ser Leu Tyr Arg Arg Pro Arg Ala He Glu Gly Leu Ala Leu 
805 810 815 

GGC CTG GCC CTG GAG AAG TTC GAG GAG GAG CGC ATC AGC CGT GGG GGG 
Gly Leu Ala Leu Glu Lys Phe Glu Glu Glu Arg He Ser Arg Gly Gly 
820 825 830 835 

TCC CTC AAG GGG GAT GGA GGG GAC CGA TCC AGT GCC CTG GAC AAC CAG 
Ser Leu Lys Gly Asp Gly Gly Asp Arg Ser Ser Ala Leu Asp Asn Gin 
840 845 850 

AGG ACC CCT TTG TCC CTG GGC CAG CGG GAG CCA CCA TGG CTG GCC AGG 
Arg Thr Pro Leu Ser Leu Gly Gin Arg Glu Pro Pro Trp Leu Ala Arg 
855 860 865 

CCC TGT CAT GGA AAC TGT GAC CCG ACT CAG CAG GAG GCA GGG GGA GGA 
Pro Cys His Gly Asn Cys Asp Pro Thr Gin Gin Glu Ala Gly Gly Gly 
870 875 880 

GAG GCT GTG GTG ACC TTT GAG GAC CGG GCC AGG CAC AGG CAG AGC CAA 
Glu Ala Val Val Thr Phe Glu Asp Arg Ala Arg His Arg Gin Ser Gin 
885 890 895 

CGG CGC AGC CGG CAT CGC CGC GTC AGG ACA GAA GGC AAG GAG TCC TCT 
Arg Arg Ser Arg His Arg Arg Val Arg Thr Glu Gly Lys Glu Ser Ser 
900 905 910 915 

TCA GCC TCC CGG AGC AGG TCT GCC AGC CAG GAA CGC AGT CTG GAT GAA 
Ser Ala Ser Arg Ser Arg Ser Ala Ser Gin Glu Arg Ser Leu Asp Glu 
920 925 930 

GCC ATG CCC ACT GAA GGG GAG AAG GAC CAT GAG CTC AGG GGC AAC CAT 
Ala Met Pro Thr Glu Gly Glu Lys Asp His Glu Leu Arg Gly Asn His 
935 940 945 
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GGT GCC AAG GAG CCA ACG ATC CAA GAA GAG AGA GCC CAG GAT TTA AGG 
Gly Ala Lys Glu Pro Thr He Gin Glu Glu Arg Ala Gin Asp Leu Arg 
950 955 960 

AGG ACC AAC AGT CTG ATG GTG TCC AGA GGC TCC GGG CTG GCA GGA GGC 
Arg Thr Asn Ser Leu Met Val Ser Arg Gly Ser Gly Leu Ala Gly Gly 
965 970 975 

CTT GAT GAG GCT GAC ACC CCC CTA GTC CTG CCC CAT CCT GAG CTG GAA 
Leu Asp Glu Ala Asp Thr Pro Leu Val Leu Pro His Pro Glu Leu Glu 
980 985 990 995 

GTC GGG AAG CAC GTG GTG CTG ACG GAG CAG GAG CCA GAA GGC AGC AGT 
Val Gly Lys His Val Val Leu Thr Glu Gin Glu Pro Glu Gly Ser Ser 
1000 1005 1010 

GAG CAG GCC CTG CTG GGG AAT GTG CAG CTA GAC ATG GGC CGG GTC ATC 
Glu Gin Ala Leu Leu Gly Asn Val Gin Leu Asp Met Gly Arg Val He 
1015 1020 1025 

AGC CAG AGC GAG CCT GAC CTC TCC TGC ATC ACG GCC AAC ACG GAC AAG 
Ser Gin Ser Glu Pro Asp Leu Ser Cys He Thr Ala Asn Thr Asp Lys 
1030 1035 1040 

GCC ACC ACC GAG AGC ACC AGC GTC ACC GTC GCC ATC CCC GAC GTG GAC 
Ala Thr Thr Glu Ser Thr Ser Val Thr Val Ala He Pro Asp Val Asp 
1050 1055 

CCC TTG GTG GAC TCA ACC GTG GTG CAC ATT AGC AAC AAG ACG GAT GGG 
Pro Leu Val Asp Ser Thr Val Val His He Ser Asn Lys Thr Asp Gly 
1060 1065 1070 1075 

GAA GCC AGT CCC TTG AAG GAG GCA GAG ATC AGA GAG GAT GAG GAG GAG 
Glu Ala Ser Pro Leu Lys Glu Ala Glu He Arg Glu Asp Glu Glu Glu 
1080 1085 1090 

GTG GAG AAG AAG AAG CAG AAG AAG GAG AAG CGT GAG ACA GGC AAA GCC 
Val Glu Lys Lys Lys Gin Lys Lys Glu Lys Arg Glu Thr Gly Lys Ala 
1095 iioo H05 

ATG GTG CCC CAC AGC TCA ATG TTC ATC TTC AGC ACC ACC AAC CCG ATC 
Met Val Pro His Ser Ser Met Phe He Phe Ser Thr Thr Asn Pro He 
1110 1115 1120 

CGG AGG GCC TGC CAC TAC ATC GTG AAC CTG CGC TAC TTT GAG ATG TGC 
Arg Arg Ala Cys His Tyr He Val Asn Leu Arg Tyr Phe Glu Met Cys 
1125 H30 H35 

ATC CTC CTG GTG ATT GCA GCC AGC AGC ATC GCC CTG GCG GCA GAG GAC 
He Leu Leu Val He Ala Ala Ser Ser He Ala Leu Ala Ala Glu Asp 
1140 1145 H50 H55 

CCC GTC CTG ACC AAC TCG GAG CGC AAC AAA GTC CTG AGG TAT TTT GAC 
Pro Val Leu Thr Asn Ser Glu Arg Asn Lys Val Leu Arg Tyr Phe Asp 
1160 H65 1170 
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TAT GTG TTC ACG GGC GTG TTC ACC TTT GAG ATG GTT ATA AAG ATG ATA 3726 
Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val He Lys Met He 
1175 1180 1185 

GAC CAA GGC TTG ATC CTG CAG GAT GGG TCC TAC TTC CGA GAC TTG TGG 3774 
Asp Gin Gly Leu He Leu Gin Asp Gly Ser Tyr Phe Arg Asp Leu Trp 
1190 1195 1200 

AAC ATC CTG GAC TTT GTG GTG GTC GTT GGC GCA TTG GTG GCC TTT GCT 3 822 

Asn He Leu Asp Phe Val Val Val Val Gly Ala Leu Val Ala Phe Ala 
1205 ...1210 1215 

CTG GCG AAC GCT TTG GGA ACC AAC AAA GGA CGG GAC ATC AAG ACC ATC 3 870 

Leu Ala Asn Ala Leu Gly Thr Asn Lys Gly Arg Asp He Lys Thr He 
1220 1225 1230 1235 

AAG TCT CTG CGG GTG CTC CGA GTT CTA AGG CCA CTG AAA ACC ATC AAG 3 916 

Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys Thr He Lys 
1240 1245 1250 

CGC TTG CCC AAG CTC AAG GCC GTC TTC GAC TGC GTA GTG ACC TCC TTG 3966 
Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Thr Ser Leu 
1255 1260 1265 

AAG AAT GTC TTC AAC ATA CTC ATT GTG TAC AAG CTC TTC ATG TTC ATC 4014 
Lys Asn Val Phe Asn He Leu He Val Tyr Lys Leu Phe Met Phe He 
1270 1275 1280 

TTT GCT GTC ATC GCA GTT CAG CTC TTC AAG GGA AAG TTC TTT TAT TGC 4062 
Phe Ala Val He Ala Val Gin Leu Phe Lys Gly Lys Phe Phe Tyr Cys 
1285 1290 1295 

ACG GAC AGT TCC AAG GAC ACA GAG AAG GAG TGC ATA GGC AAC TAT GTA 4110 
Thr Asp Ser Ser Lys Asp Thr Glu Lys Glu Cys He Gly Asn Tyr Val 
1300 1305 1310 1315 

GAT CAC GAG AAA AAC AAG ATG GAG GTG AAG GGC CGG GAA TGG AAG CGC 4158 
Asp His Glu Lys Asn Lys Met Glu Val Lys Gly Arg Glu Trp Lys Arg 
1320 1325 1330 

CAT GAA TTC CAC TAC GAC AAC ATT ATC TGG GCC CTG CTG ACC CTC TTC 4206 
His Glu Phe His Tyr Asp Asn He He Trp Ala Leu Leu Thr Leu Phe 
1335 1340 1345 

ACC GTC TCC ACA GGG GAA GGA TGG CCT CAA GTT CTG CAG CAC TCT GTA 4254 
Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Gin His Ser Val 
1350 * 1355 1360 

GAT GTG ACA GAG GAA GAC CGA GGC CCA AGC CGC AGC AAC CGC ATG GAG 4302 
Asp Val Thr Glu Glu Asp Arg Gly Pro Ser Arg Ser Asn Arg Met Glu 
1365 1370 1375 

ATG TCT ATC TTT TAT GTA GTC TAC TTT GTG GTC TTC CCC TTC TTC TTT 4350 
Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe Phe 
1380 1385 1390 1395 
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GTC AAT ATC TTT GTG GCT CTC ATC ATC ATC ACC TTC CAG GAG CAA GGG 
Val Asn lie Phe Val Ala Leu He He He Thr Phe Gin Glu Gin Gly 
1400 1405 1410 

GAT AAG ATG ATG GAG GAG TGC AGC CTG GAG AAG AAT GAG AGG GCG TGC 
Asp Lys Met Met Glu Glu Cys Ser Leu Glu Lys Asn Glu Arg Ala Cvs 
1415 1420 1425 

xTf ? AC GCC A T C AGC GCC *** CCT CTC ACC CGC TAC ATG CCG CAG 

He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg Tyr Met Pro Gin 
1430 143 5 144Q 

AAC AGA CAC ACC TTC CAG TAC CGC GTG TGG CAC TTT GTG GTG TCT CCG 

?f? " 1S Thr Phe Gln TV* ^9 Val Tr P Hi s Phe Val Val Ser Pro 
1445 1450 1455 

cf S ll T S AG I AC ACC ATT ATG GCC ATG ATC GCC TTG AAT ACT GTT GTG 
Ser Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn Thr Val Val 
1460 1465 1470 1475 

CTG ATG ATG AAG TAT TAT TCT GCT CCC TGT ACC TAT GAG CTG GCC CTG 
Leu Met Met Lys Tyr Tyr Ser Ala Pro Cys Thr Tyr Glu Leu Ala Leu 
1480 1485 1490 

AAG TAC CTG AAT ATC GCC TTC ACC ATG GTG TTT TCC CTG GAA TGT GTC 
Lys Tyr Leu Asn He Ala Phe Thr Met Val Phe Ser Leu Glu Cys Val 
1495 1500 1505 

CTG AAG GTC ATC GCT TTT GGC TTT TTG AAC TAT TTC CGA GAC ACC TGG 
Leu Lys Val He Ala Phe Gly Phe Leu Asn Tyr Phe Arg Asp Thr Tro 
151° 1515 ' 1520 

AAT ATC TTT GAC TTC ATC ACC GTG ATT GGC AGT ATC ACA GAA ATT ATC 
Asn He Phe Asp Phe He Thr Val He Gly Ser He Thr Glu He He 
1525 1530 1535 

CTG ACA GAC AGC AAG CTG GTG AAC ACC AGT GGC TTC AAT ATG AGC TTT 
Leu Thr Asp Ser Lys Leu Val Asn Thr Ser Gly Phe Asn Met Ser Phe 
1540 1545 1550 1555 

CTG AAG CTC TTC CGA GCT GCC CGC CTC ATA AAG CTC CTG CGT CAG GGC 
Leu Lys Leu Phe Arg Ala Ala Arg Leu He Lys Leu Leu Arg Gin Gly 
1560 1565 1570 

TAT ACC ATA CGC ATT TTG CTG TGG ACC TTT GTG CAG TCC TTT AAG GCC 
Tyr Thr He Arg He Leu Leu Trp Thr Phe Val Gin Ser Phe Lys Ala 
1575 1580 1585 

CTC CCT TAT GTC TGC CTT TTA ATT GCC ATG CTT TTC TTC ATT TAT GCC 
Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu Phe Phe He Tyr Ala 
1590 1595 1600 

ATC ATT GGG ATG CAG GTA TTT GGA AAC ATA AAA TTA GAC GAG GAG AGT 
He He Gly Met Gin Val Phe Gly Asn He Lys Leu Asp Glu Glu Ser 
1605 1610 1615 
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CAC 
His 
162C 


ATC 
He 


AAC CGG 
Asn Arg 


CAC 
His 


AAC AAC TTC CGG 
Asn Asn Phe Arg 
1625 


AGT 
Ser 


TTC TTT 
Phe Phe 
1630 


GGG TCC CTA 
Gly Ser Leu 


ATG 
Met 
1635 


CTA 
Leu 


CTC 
Leu 


TTC AGG 
Phe Arg 


AGT GCC 
Ser Ala 
1640 


ACA GGT GAG GCC TGG 
Thr Gly Glu Ala Trp 
1645 


CAG 
Gin 


GAG 
Glu 


ATT 
He 


ATG CTG 
Met Leu 
1650 


TCA 
Ser 


TGC 
Cys 


CTT GGG GAG 
Leu Gly Glu 
1655 


AAG 
Lys 


GGC TGT 
Gly Cys 


GAG CCT 
Glu Pro 
1660 


GAC 
Asp 


ACC 
Thr 


ACC 
Thr 


GCA CCA 
Ala Pro 
1665 


TCA 
Ser 


GGG 
Gly 


CAG 
Gin 


AAC GAG 
Asn Glu 
1670 


AAT 
Asn 


GAA 
Glu 


CGC TGC GGC 
Arg Cys Gly 
1675 


ACC 
Thr 


GAT 
Asp 


CTG 
Leu 


GCC TAC 
Ala Tyr 
1680 


GTG 
Val 


TAC 
Tyr 


TTT 
Phe 


GTC TCC TTC 
Val Ser Phe 
1685 


ATC 
He 


TTC 
Phe 


TTC TGC 
Phe Cys 
1690 


TCC 

Ser 


Phe 


TTG 
Leu 


ATG CTC 
Met Leu 
1695 


AAC 
Asn 


CTG 
Leu 


Phe 


GTG GCC 
Val Ala 
1700 


GTC ATC 
Val He 


ATG 
Met 


GAC AAC TTT 
Asp Asn Phe 
1705 


GAG 
Glu 


TAC 
Tyr 


CTG ACT CGG 
Leu Thr Arg 
1710 


GAC 
Asp 


TCC 
Ser 


TCC 
Ser 
1715 


ATC 
He 


CTG 
Leu 


GGG CCT 
Gly Pro 


CAC CAC 
His His 
1720 


TTG GAC 
Leu Asp 


GAG 
GlU 


TTT GTC 
Phe Val 
1725 


CGC 
Arg 


GTC 
val 


TGG 
Trp 


GCA GAA 
Ala Glu 
1730 


TAT 
Tyr 


GAC 
Asp 


CGA GCA GCA 
Arg Ala Ala 
1735 


TGT 
Cys 


GGC CGC 
Gly Arg 


ATC CAT 
He His 
1740 


TAC 
Tyr 


ACT 
Thr 


GAG 
Glu 


ATG TAT 
Met Tyr 
1745 


GAA 
Glu 


ATG 
Met 


CTG 
Leu 


ACT CTC 
Thr Leu 
1750 


ATG 
Met 


TCA 
Ser 


CCT CCG CTA 
Pro Pro Leu 
1755 


GGC 
Gly 


CTC 
Leu 


GGC AAG AGA 
Gly Lys Arg 
1760 


TGT 
Cys 


CCC 
Pro 


TCC 
Ser 


AAA GTG GCA 
Lys Val Ala 
1765 


TAT 
Tyr 


AAG 
Lys 


AGG TTG 
Arg Leu 
1770 


GTC 
Val 


CTG 


ATG 
Met 


AAC ATG 
Asn Met 
1775 


CCA 

Pro 


GTA 
Val 


GCT 
Ala 


GAG GAC 
Glu Asp 
1780 


ATG ACG 
Met Thr 


GTC 
val 


CAC TTC ACC 
His Phe Thr 
1785 


TCC 
Ser 


ACA 
Thr 


CTT ATG 
Leu Met 
1790 


GCT 
Ala 


CTG 
Leu 


ATC 
He 


CGG 
Arg 
1795 


ACA 
Thr 


GCT 
Ala 


CTG GAC 
Leu Asp 


ATT AAA 
He Lys 
1800 


ATT GCC 
He Ala 


AAA 
Lys 


GGT GGT GCA 
Gly Gly Ala 
1805 


GAC AGG 
Asp Arg 


CAG CAG 
Gin Gin 
1810 


CTA GAC 
Leu Asp 


TCA GAG CTA 
Ser Glu Leu 
1815 


CAA 
Gin 


AAG GAG 
Lys Glu 


ACC CTA 
Thr Leu 
1820 


GCC 
Ala 


ATC 
He 


TGG 
Trp 


CCT CAC 
Pro His 
1825 


CTA 
Leu 



TCC CAG AAG ATG CTG GAT CTG CTT GTG CCC ATG CCC AAA GCC TCT GAC 
Ser Gin Lys Met Leu Asp Leu Leu Val Pro Met Pro Lys Ala Ser Asp 
1830 1835 1840 
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CTG ACT GTG GGC AAA ATC TAT GCA GCA ATG ATG ATC ATG GAC TAC TAT 
Thr Val Gly Lys He Tyr Ala Ala Met Met He nli Sp Ty^ Ty^ 
1845 1850 1855 

AAG CAG AGT AAG GTG AAG AAG CAG AGG CAG CAG CTG GAG GAA CAG AAA 

i^n 6r LyS Val LyS LyS Gln Gln Gln Leu Glu Su Gin Lys 

1860 1865 1870 ^ 

Jin P^o m TG ll C ^ G CGC ATG GAG CCT TCA TCT CTG CCT CAG GAG 

Asn Ala Pro Met Phe Gln Arg Met Glu Pro Ser Ser Leu Pro Gln Glu 

1880 1885 1890 

lie rll a? T GCC m GCC CTG CCT TAC CTC CAG CAG GAC CCC GTT 

He He Ala Asn Ala Lys Ala Leu Pro Tyr Leu Gln Gln Asp Pro Val 
18 95 isoo 1905 

TCA GGC CTG AGT GGC CGG AGT GGA TAC CCT TCG ATG AGT CCA CTC TCT 
Ser Gly Leu Ser Gly Arg Ser Gly Tyr Pro Ser Me? Ser Pro LeS sS 
1910 1915 1920 

Sro Gln t? A IZ C ^ G TTG GCT TCT ATG ^ CCC GCC Q AT GAC GGA 

Pr ° Ile Phe Gln Le ^ Ala Cys Met Asp Pro Ala Asp Asp Gly 

AS." 1930 1935 

^ 1™ CAA GAA CGG CAG TCT CTG GTG GTG ACA GAC CCT AGC TCC ATG 
?Sn " G1U Gln Ser Leu Val Val Thr AS P p ™ Ser Ser Met 

iS4 ° 1945 1950 1955 

AGA f G * TCA TTT TCC ACT ATT CGG GAT AAG CGT TCA AAT TCC TCG TGG 
Arg Arg Ser Phe Ser Thr Ile Arg Asp Lys Arg Ser Asn Ser Ser Trp 
I960 1965 19?0 * 

U?, rlf, f^ft 11° I CC ATG GAG CGA AGC AGT GAA AAT ACC TAC AAG TCC 
Leu Glu Glu Phe Ser Met Glu Arg Ser Ser Glu Asn Thr Tyr Lys Ser 
1975 i9 80 19B5 

Itl CGG AGT ^ AC CAC TCC TCC TTG CGG CTG TCA GCC CAC CGC CTG 

Arg Arg Arg Ser Tyr His Ser Ser Leu Arg Leu Ser Ala His Arg Leu 
1990 1995 2000 

AAC TCT GAT TCA GGC CAC AAG TCT GAC ACT CAC CCC TCA GGG GGC AGG 
?«L ASP SSr Gly Hls Ser As P Thr His p ro Ser Gly Gly Arq 

2005 2010 2015 

GAG CGG CGA CGA TCA AAA GAG CGA AAG CAT CTT CTC TCT CCT GAT GTC 
G «™ 9 9 Ser Lys Glu ^9 His Leu Leu Ser Pro Asp Val 

2020 2025 2030 2035 

TCC CGC TGC AAT TCA GAA GAG CGA GGG ACC CAG GCT GAC TGG GAG TCC 
Ser Arg Cys Asn Ser Glu Glu Arg Gly Thr Gln Ala Asp Trp Glu Ser 
2040 2045 2050 

CCA GAG CGC CGT CAA TCC AGG TCA CCC AGT GAG GGC AGG TCA CAG ACG 
Pro Glu Arg Arg Gln Ser Arg Ser Pro Ser Glu Gly Arg Ser Gln Thr 
2055 2060 2065 
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CCC AAC AGA CAG GGC ACA GGT TCC CTA AGT GAG AGC TCC ATC CCC TCT 6414 
Pro Asn Arg Gin Gly Thr Gly Ser Leu Ser Glu Ser Ser He Pro Ser 
2070 2075 2080 

GTC TCT GAC ACC AGC ACC CCA AGA AGA AGT CGT CGG CAG CTC CCA CCC 6462 
Val Ser Asp Thr Ser Thr Pro Arg Arg Ser Arg Arg Gin Leu Pro Pro 
2085 2090 2095 

GTC CCG CCA AAG CCC CGG CCC CTC CTT TCC TAC AGC TCC CTG ATT CGA 6510 
Val Pro Pro Lys Pro Arg Pro Leu Leu Ser Tyr Ser Ser Leu He Arg 
2100 * 2105 2110 2115 

CAC GCG GGC AGC ATC TCT CCA CCT GCT GAT GGA AGC GAG GAG GGC TCC 6558 
His Ala Gly Ser He Ser Pro Pro Ala Asp Gly Ser Glu Glu Gly Ser 
2120 2125 2130 

CCG CTG ACC TCC CAA GCT CTG GAG AGC AAC AAT GCT TGG CTG ACC GAG 6606 
Pro Leu Thr Ser Gin Ala Leu Glu Ser Asn Asn Ala Trp Leu Thr Glu 
2135 2140 2145 

TCT TCC AAC TCT CCG CAC CCC CAG CAG AGG CAA CAT GCC TCC CCA CAG 6654 
Ser Ser Asn Ser Pro His Pro Gin Gin Arg Gin His Ala Ser Pro Gin 
23.50 2155 2160 

CGC TAC ATC TCC GAG CCC TAC TTG GCC CTG CAC GAA GAC TCC CAC GCC 6702 
Arg Tyr He Ser Glu Pro Tyr Leu Ala Leu His Glu Asp Ser His Ala 
2165 2170 2175 

TCA GAC TGT GTT GAG GAG GAG ACG CTC ACT TTC GAA GCA GCC GTG GCT 675 0 

Ser Asp Cys Val Glu Glu Glu Thr Leu Thr Phe Glu Ala Ala Val Ala 
2180 2185 2190 2195 

ACT AGC CTG GGC CGT TCC AAC ACC ATC GGC TCA GCC CCA CCC CTG CGG 6798 
Thr Ser Leu Gly Arg Ser Asn Thr He Gly Ser Ala Pro Pro Leu Arg 
2200 2205 2210 

CAT AGC TGG CAG ATG CCC AAC GGG CAC TAT CGG CGG CGG AGG CGC GGG 684 6 

His Ser Trp Gin Met Pro Asn Gly His Tyr Arg Arg Arg Arg Arg Gly 
2215 2220 2225 

GGG CCT GGG CCA GGC ATG ATG TGT GGG GCT GTC AAC AAC CTG CTA AGT 6 8 94 

Gly Pro Gly Pro Gly Met Met Cys Gly Ala Val Asn Asn Leu Leu Ser 
2230 2235 2240 

GAC ACG GAA GAA GAT GAC AAA TGC TAGAGGCTGC TCCCCCCTCC GATGCATGCT 6948 
Asp Thr Glu Glu Asp Asp Lys Cys 
2245 2250 

CTTCTCTCAC ATGGAGAAAA CCAAGACAGA ATTGGGAAGC CAGTGCGGCC CCGCGGGGAG 7008 



GAAGAGGGAA AAGGAAGATG GAAG 
(2) INFORMATION FOR SEQ ID NO: 25: 



7032 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7089 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.. 6978 

(D) OTHER INFORMATION: /standard_name= "Alpha -IE- 3 " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GCTGCTGCTG CCTCTCCGAA GAGCTCGCGG AGCTCCCCAG AGGCGGTGGT CCCCGTGCTT 
GTCTGGATGC GGCTCTGAGT CTCCGTGTGT CTTTCTGCTT GTTGCTGTGT GCGGGTGTTC 
GGCCGCGATC ACCTTTGTGT GTCTTCTGTC TGTTTAAACC TCAGG ATG GCT CGC 

Met Ala Arg 

^ S K S 32 S S S SJ S m 5 S 25 S 

s s s iss sj s s « « s « - jc. „ « 

25 30 35 

S?2 ?? C ^ AC *** CAG ACG ^ GCA AGG GCG CGG ACT ATG GCT 

Ala Ala Ala Tyr Lys Gin Thr Lys Ala Gin Arg Ala Arg £5 Me? SI 

45 50 

TTG TAC AAC CCC ATT CCC GTC CGG CAG AAC TGT TTC ACC GTC AAC AGA 
Leu Tyr Asn Pro lie Pro Val Arg Gin Asn Cys Phe vll ten irj 

55 60 6 5 

TCC CTG TTC ATC TTC GGA GAA GAT AAC ATT GTC AGG AAA TAT GCC AAG 
Ser Leu Phe lie Phe Gly Glu Asp Asn He Val Arg Lys Tyr Ala iJ2 

AAG CTC ATC GAT TGG CCG CCA TTT GAG TAC ATG ATC CTG GCC ACC ATC 
Lys Leu He Asp Trp Pro Pro Phe Glu Tyr Met He Leu Ala Thr He 
85 9 ° 95 

ill Sf £ GC A T C GTC CTG GCC CTG CAG CAT CTT CCT GAG GAT 

lie Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Glu Asp 

00 105 no ns 

GAC . f* 6 $2° CCC ATG TCC CGA AGA CTG GAG AAG ACA GAA CCT TAT TTC 
Asp Lys Thr Pro Met Ser Arg Arg Leu Glu Lys Thr Glu Pro Tyr Phe 

120 125 xlo 

ATT GGG ATC TTT TGC TTT GAA GCT GGG ATC AAA ATT GTG GCC CTG GGG 



120 
174 
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Ile Gly He Phe Cys Phe Glu Ala Gly He Lys He Val Ala Leu Gly 
135 140 145 

TTC ATC TTC CAT AAG GGC TCT TAC CTC CGC AAT GGC TGG AAT GTC ATG 654 
Phe He Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
150 155 160 

GAC TTC ATC GTG GTC CTC AGT GGC ATC CTG GCC ACT GCA GGA ACC CAC 702 
Asp Phe He Val Val Leu Ser Gly He Leu Ala Thr Ala Gly Thr His 
165 170 175 

TTC AAT ACT CAC GTG GAC CTG AGG ACC CTC CGG GCT GTG CGT GTC CTG 750 
Phe Asn Thr His Val Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu 
180 185 190 195 

CGG CCT TTG AAG CTC GTG TCA GGG ATA CCT AGC CTG CAG ATT GTG TTG 798 
Arg Pro Leu Lys Leu Val Ser Gly He Pro Ser Leu Gin He Val Leu 
200 205 210 

AAG TCC ATC ATG AAG GCC ATG GTA CCT CTT CTG CAG ATT GGC CTT CTG 846 
Lys Ser He Met Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu 
215 220 225 

CTC TTC TTT GCC ATC CTG ATG TTT GCT ATC ATT GGT TTG GAG TTC TAC 894 
Leu Phe Phe Ala He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr 
230 235 240 

AGT GGC AAG TTA CAT CGA GCG TGC TTC ATG AAC AAT TCA GGT ATT CTA 942 
Ser Gly Lys Leu His Arg Ala Cys Phe Met Asn Asn Ser Gly He Leu 
245 250 255 

GAA GGA TTT GAC CCC CCT CAC CCA TGT GGT GTG CAG GGC TGC CCA GCT 990 
Glu Gly Phe Asp Pro Pro His Pro Cys Gly Val Gin Gly Cys Pro Ala 
260 265 270 275 

GGT TAT GAA TGC AAG GAC TGG ATC GGC CCC AAT GAT GGG ATC ACC CAG 1038 
Gly Tyr Glu Cys Lys Asp Trp He Gly Pro Asn Asp Gly He Thr Gin 
280 285 290 

TTT GAT AAC ATC CTT TTT GCT GTG CTG ACT GTC TTC CAG TGC ATC ACC 1086 
Phe Asp Asn He Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr 
295 300 305 

ATG GAA GGG TGG ACC ACT GTG CTG TAC AAT ACC AAT GAT GCC TTA GGA 1134 
Met Glu Gly Trp Thr Thr Val Leu Tyr Asn Thr Asn Asp Ala Leu Gly 
310 315 320 

GCC ACC TGG AAT TGG CTG TAC TTC ATC CCC CTC ATC ATC ATT GGA TCC 1182 
Ala Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser 
325 330 335 

TTC TTT GTT CTC AAC CTA GTC CTG GGA GTG CTT TCC GGG GAA TTT GCC 123 0 

Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala 
340 345 350 355 

AAA GAG AGA GAG AGA GTG GAG AAC CGA AGG GCT TTC ATG AAG CTG CGG 1278 
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Lys Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Met Lys Leu Arq 
36 ° 365 370 

CGC CAG CAG CAG ATT GAG CGT GAG CTG AAT GGC TAC CGT GCC TGG ATA 
Arg Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Arg Ala Trp He 
375 380 385 

GAC AAA GCA GAG GAA GTC ATG CTC GCT GAA GAA AAT AAA AAT GCT GGA 
Asp Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asn Lys Asn Ala Gly 
390 395 400 

ACA TCC GCC TTA GAA GTG CTT CGA AGG GCA ACC ATC AAG AGG AGC CGG 

f«f a Leu Glu Val Leu ^9 ^9 A 13 Thr He Lys Arg Ser Arg 
405 410 415 

ACA GAG GCC ATG ACT CGA GAC TCC AGT GAT GAG CAC TGT GTT GAT ATC 
Thr Glu Ala Met Thr Arg Asp Ser Ser Asp Glu His Cys Val Asp He 
420 425 430 " 435 

TCC TCT GTG GGC ACA CCT CTG GCC CGA GCC AGT ATC AAA AGT GCA AAG 
Ser Ser Val Gly Thr Pro Leu Ala Arg Ala Ser He Lys Ser Ala Lys 
440 445 450 

GTA GAC GGG GTC TCT TAT TTC CGG CAC AAG GAA AGG CTT CTG CGC ATC 
Val Asp Gly Val Ser Tyr Phe Arg His Lys Glu Arg Leu Leu Arg He 
4 55 460 465 

TCC ATT CGC CAC ATG GTT AAA TCC CAG GTG TTT TAC TGG ATT GTG CTG 
Ser He Arg His Met Val Lys Ser Gin Val Phe Tyr Trp He Val Leu 
470 475 480 

AGC CTT GTG GCA CTC AAC ACT GCC TGT GTG GCC ATT GTC CAT CAC AAC 
Ser Leu Val Ala Leu Asn Thr Ala Cys Val Ala He Val His His Asn 
485 490 495 

CAG CCC CAG TGG CTC ACC CAC CTC CTC TAC TAT GCA GAA TTT CTG TTT 
Gin Pro Gin Trp Leu Thr His Leu Leu Tyr Tyr Ala Glu Phe Leu Phe 
500 505 510 sis 

CTG GGA CTC TTC CTC TTG GAG ATG TCC CTG AAG ATG TAT GGC ATG GGG 
Leu Gly Leu Phe Leu Leu Glu Met Ser Leu Lys Met Tyr Gly Met Gly 
520 525 530 

CCT CGC CTT TAT TTT CAC TCT TCA TTC AAC TGC TTT GAT TTT GGG GTC 
Pro Arg Leu Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
535 540 545 

ACA GTG GGC AGT ATC TTT GAA GTG GTC TGG GCA ATC TTC AGA CCT GGT 
Thr Val Gly Ser He Phe Glu Val Val Trp Ala He Phe Arq Pro Glv 
550 555 560 

ACG TCT TTT GGA ATC AGT GTC TTG CGA GCC CTC CGG CTT CTA AGA ATA 
Thr Ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arq He 
565 570 575 

TTT AAA ATA ACC AAG TAT TGG GCT TCC CTA CGG AAT TTG GTG GTC TCC 
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Phe Lys lie Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser 
580 585 590 595 

TTG ATG AGC TCA ATG AAG TCT ATC ATC AGT TTG CTT TTC CTC CTC TTC 
Leu Met Ser Ser Met Lys Ser lie He Ser Leu Leu Phe Leu Leu Phe 
• 600 605 610 

CTC TTC ATC GTT GTC TTT GCT CTC CTA GGA ATG CAG TTA TTT GGA GGC 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
615 620 625 

AGG TTT AAC TTT AAT GAT GGG ACT CCT TCG GCA AAT TTT GAT ACC TTC 
Arg Phe Asn Phe Asn Asp Gly Thr Pro Ser Ala Asn Phe Asp Thr Phe 
630 635 640 

CCT GCA GCC ATC ATG ACT GTG TTC CAG ATC CTG ACG GGT GAG GAC TGG 
Pro Ala Ala He Met Thr Val Phe Gin lie Leu Thr Gly Glu Asp Trp 
645 650 655 

AAT GAG GTG ATG TAC AAT GGG ATC CGC TCC CAG GGT GGG GTC AGC TCA 
Asn Glu Val Met Tyr Asn Gly He Arg Ser Gin Gly Gly Val Ser Ser 
660 665 670 675 

GGC ATG TGG TCT GCC ATC TAC TTC ATT GTG CTC ACC TTG TTT GGC AAC 
Glv Met Trp Ser Ala He Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
680 685 690 

TAC ACG CTA CTG AAT GTG TTC TTG GCT ATC GCT GTG GAT AAT CTC GCC 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
695 700 705 

AAC GCC CAG GAR CTG ACC AAG GAT GAA CAG GAG GAA GAA GAG GCC TTC 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Gin Glu Glu Glu Glu Ala Phe 
710 715 720 

AAC CAG AAA CAT GCA CTG CAG AAG GCC AAG GAG GTC AGC CCG ATG TCT 
Asn Gin Lys His Ala Leu Gin Lys Ala Lys Glu Val Ser Pro Met Ser 
725 730 735 

GCA CCC AAC ATG CCT TCG ATC GAA AGA GAC AGA AGG AGA AGA CAC CAC 
Ala Pro Asn Met Pro Ser He Glu Arg Asp Arg Arg Arg Arg His His 
740 745 750 755 

ATG TCG ATG TGG GAG CCA CGC AGC AGC CAC CTG AGG GAG CGG AGG CGC 
Met Ser Met Trp Glu Pro Arg Ser Ser His Leu Arg Glu Arg Arg Arg 
760 765 770 

CGG CAC CAC ATG TCC GTG TGG GAG CAG CGT ACC AGC CAG CTG AGG AAG 
Arq His His Met Ser Val Trp Glu Gin Arg Thr Ser Gin Leu Arg Lys 
775 780 785 

CAC ATG CAG ATG TCC AGC CAG GAG GCC CTC AAC AGA GAG GAG GCG CCG 
His Met Gin Met Ser Ser Gin Glu Ala Leu Asn Arg Glu Glu Ala Pro 
790 795 800 

ACC ATG AAC CCG CTC AAC CCC CTC AAC CCG CTC AGC TCC CTC AAC CCG 
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Thr Met Asn Pro Leu Asn Pro Leu Asn Pro Leu Ser Ser Leu Asn Pro 
805 8 io 815 

CTC AAT GCC CAC CCC AGC CTT TAT CGG CGA CCC AGG GCC ATT GAG GGC 
Leu Asn Ala His Pro Ser Leu Tyr Arg Arg Pro Arg Ala lie Glu Gly 
820 825 830 835 

CTG GCC CTG GGC CTG GCC CTG GAG AAG TTC GAG GAG GAG CGC ATC AGC 
Leu Ala Leu Gly Leu Ala Leu Glu Lys Phe Glu Glu Glu Arg lie Ser 
840 845 850 

CGT GGG GGG TCC CTC AAG GGG GAT GGA GGG GAC CGA TCC AGT GCC CTG 
Arg Gly Gly Ser Leu Lys Gly Asp Gly Gly Asp Arg Ser Ser Ala Leu 
855 8S0 865 

GAC AAC CAG AGG ACC CCT TTG TCC CTG GGC CAG CGG GAG CCA CCA TGG 
Asp Asn Gin Arg Thr Pro Leu Ser Leu Gly Gin Arg Glu Pro Pro Tro 
870 875 880 

CTG GCC AGG CCC TGT CAT GGA AAC TGT GAC CCG ACT CAG CAG GAG GCA 
Leu Ala Arg Pro Cys His Gly Asn Cys Asp Pro Thr Gin Gin Glu Ala 
885 890 895 

GGG GGA GGA GAG GCT GTG GTG ACC TTT GAG GAC CGG GCC AGG CAC AGG 
Gly Gly Gly Glu Ala Val Val Thr Phe Glu Asp Arg Ala Arg His Arg 
900 905 910 9i| 

CAG AGC CAA CGG CGC AGC CGG CAT CGC CGC GTC AGG ACA GAA GGC AAG 
Gin Ser Gin Arg Arg Ser Arg His Arg Arg Val Arg Thr Glu Gly Lys 
920 925 930 

GAG TCC TCT TCA GCC TCC CGG AGC AGG TCT GCC AGC CAG GAA CGC AGT 
Glu Ser Ser Ser Ala Ser Arg Ser Arg Ser Ala Ser Gin Glu Arg Ser 
935 940 945 

CTG GAT GAA GCC ATG CCC ACT GAA GGG GAG AAG GAC CAT GAG CTC AGG 
Leu Asp Glu Ala Met Pro Thr Glu Gly Glu Lys Asp His Glu Leu Arq 
950 955 960 

GGC AAC CAT GGT GCC AAG GAG CCA ACQ ATC CAA GAA GAG AGA GCC CAG 
Gly Asn His Gly Ala Lys Glu Pro Thr He Gin Glu Glu Arg Ala Gin 
965 970 975 

GAT TTA AGG AGG ACC AAC AGT CTG ATG GTG TCC AGA GGC TCC GGG CTG 
Asp Leu Arg Arg Thr Asn Ser Leu Met Val Ser Arg Gly Ser Gly Leu 
980 985 990 995 

GCA GGA GGC CTT GAT GAG GCT GAC ACC CCC CTA GTC CTG CCC CAT CCT 
Ala Gly Gly Leu Asp Glu Ala Asp Thr Pro Leu Val Leu Pro His Pro 
1000 1005 1010 

GAG CTG GAA GTG GGG AAG CAC GTG GTG CTG ACG GAG CAG GAG CCA GAA 
Glu Leu Glu Val Gly Lys His Val Val Leu Thr Glu Gin Glu Pro Glu 
1015 1020 1025 

GGC AGC AGT GAG CAG GCC CTG CTG GGG AAT GTG CAG CTA GAC ATG GGC 
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Gly Ser Ser Glu Gin Ala Leu Leu Gly Asn Val Gin Leu Asp Met Gly 
1030 1035 1040 

CGG GTC ATC AGC CAG AGC GAG CCT GAC CTC TCC TGC ATC ACG GCC AAC 
Arg Val He Ser Gin Ser Glu Pro Asp Leu Ser Cys He Thr Ala Asn 
1045 1050 1055 

ACG GAC AAG GCC ACC ACC GAG AGC ACC AGC GTC ACC GTC GCC ATC CCC 
Thr Asp Lys Ala Thr Thr Glu Ser Thr Ser Val Thr Val Ala He Pro 
1060 1065 1070 1075 

GAC GTG GAC CCC TTG GTG GAC TCA ACC GTG GTG CAC ATT AGC AAC AAG 
Asp Val Asp Pro Leu Val Asp Ser Thr Val Val His He Ser Asn Lys 
1080 1085 1090 

ACG GAT GGG GAA GCC AGT CCC TTG AAG GAG GCA GAG ATC AGA GAG GAT 
Thr Asp Gly Glu Ala Ser Pro Leu Lys Glu Ala Glu He Arg Glu Asp 
1095 1100 1105 

GAG GAG GAG GTG GAG AAG AAG AAG CAG AAG AAG GAG AAG CGT GAG ACA 
Glu Glu Glu Val Glu Lys Lys Lys Gin Lys Lys Glu Lys Arg Glu Thr 
1110 1115 H20 

GGC AAA GCC ATG GTG CCC CAC AGC TCA ATG TTC ATC TTC AGC ACC ACC 
Gly Lys Ala Met Val Pro His Ser Ser Met Phe He Phe Ser Thr Thr 
li25 1130 1135 

AAC CCG ATC CGG AGG GCC TGC CAC TAC ATC GTG AAC CTG CGC TAC TTT 
Asn Pro He Arg Arg Ala Cys His Tyr He Val Asn Leu Arg Tyr Phe 
1140 1145 1150 H55 

GAG ATG TGC ATC CTC CTG GTG ATT GCA GCC AGC AGC ATC GCC CTG GCG 
Glu Met Cys He Leu Leu Val He Ala Ala Ser Ser He Ala Leu Ala 
1160 1165 H70 

GCA GAG GAC CCC GTC CTG ACC AAC TCG GAG CGC AAC AAA GTC CTG AGG 
Ala Glu Asp Pro Val Leu Thr Asn Ser Glu Arg Asn Lys Val Leu Arg 
1175 H80 H85 

TAT TTT GAC TAT GTG TTC ACG GGC GTG TTC ACC TTT GAG ATG GTT ATA 
Tvr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val He 
* 1190 1195 1200 

AAG ATG ATA GAC CAA GGC TTG ATC CTG CAG GAT GGG TCC TAC TTC CGA 
Lvs Met He Asp Gin Gly Leu He Leu Gin Asp Gly Ser Tyr Phe Arg 
1205 1210 1215 

GAC TTG TGG AAC ATC CTG GAC TTT GTG GTG GTC GTT GGC GCA TTG GTG 
Asp Leu Trp Asn He Leu Asp Phe Val Val Val Val Gly Ala Leu Val 
1220 1225 1230 1235 

GCC TTT GCT CTG GCG AAC GCT TTG GGA ACC AAC AAA GGA CGG GAC ATC 
Ala Phe Ala Leu Ala Asn Ala Leu Gly Thr Asn Lys Gly Arg Asp He 
1240 1245 1250 

AAG ACC ATC AAG TCT CTG CGG GTG CTC CGA GTT CTA AGG CCA CTG AAA 
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Lys Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys 
1255 1260 1265 

J£ iTf ^ CCC CTC **° GCC GTC TTC GAG TGC GTA GTG 

Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val 
1270 1275 1280 

ACC TCC TTG AAG AAT GTC TTC AAC ATA CTC ATT GTG TAC AAG CTC TTC 

hr ?o»c LeU LyS Asn Val Phe Asn Ile Leu val Tyr Lys Leu Phe 
1285 1290 1295 



ATG TTC ATC TTT GCT GTC ATC 
Met Phe Ile Phe Ala Val Ile 
1300 i3 0S 

TTT TAT TGC ACG GAC AGT TCC 
Phe Tyr Cys Thr Asp Ser Ser 
1320 

AAC TAT GTA GAT CAC GAG AAA 
Asn Tyr Val Asp His Glu Lys 
1335 

TGG AAG CGC CAT GAA TTC CAC 
Trp Lys Arg His Glu Phe His 
1350 



GCA GTT CAG CTC TTC AAG GGA AAG TTC 
Ala Val Gin Leu Phe Lys Gly Lys Phe 
1310 i 315 

AAG GAC ACA GAG AAG GAG TGC ATA GGC 
Lys Asp Thr Glu Lys Glu Cys Ile Gly 
1325 1330 

AAC AAG ATG GAG GTG AAG GGC CGG GAA 
Asn Lys Met Glu Val Lys Gly Arg Glu 
1340 1345 

TAC GAC AAC ATT ATC TGG GCC CTG CTG 
Tyr Asp Asn Ile lie Trp Ala Leu Leu 
1355 i3 6 o 



ACC CTC TTC ACC GTC TCC ACA GGG GAA GGA TGG CCT CAA GTT CTG CAG 
Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro S£ Sal £eu Sn 
"55 1370 1375 

CAC TCT GTA GAT GTG ACA GAG GAA GAC CGA GGC CCA AGC CGC AGC AAC 
Hxs Ser Val Asp Val Thr Glu Glu Asp Arg Gly Pro Ser Arg Ser ten 
1385 1390 1395 

CGC ATG GAG ATG TCT ATC TTT TAT GTA GTC TAC TTT GTG GTC TTC CCC 
Arg Met Glu Met Ser lie Phe Tyr Val Val Tyr Phe val val Phe Pro 
1400 1405 i4i 0 

TTC TTC TTT GTC AAT ATC TTT GTG GCT CTC ATC ATC ATC ACC TTC CAG 
Phe Phe Phe Val Asn Ile Phe Val Ala Leu lie Ile Ile Thr Phe Gin 
14X5 1420 1425 

^ ^f 3 GAT *** ATG ATG GAG GAG TGC AGC CTG GAG AAG AAT GAG 
Glu Gin Gly Asp Lys Met Met Glu Glu Cys Ser Leu Glu Lys Asn Glu 
1*30 1435 1440 

AGG GCG TGC ATC GAC TTC GCC ATC AGC GCC AAA CCT CTC ACC CGC TAC 

9 Ile Asp Phe Ala Ile ser A 1 * Lys Pro Leu Thr Arg Tyr 

1445 1450 i4 55 a * 

ATG CCG CAG AAC AGA CAC ACC TTC CAG TAC CGC GTG TGG CAC TTT GTG 
Met Pro Gin Asn Arg His Thr Phe Gin Tyr Arg Val Trp His Phe Val 
1460 1465 1470 1475 

GTG TCT CCG TCC TTT GAG TAC ACC ATT ATG GCC ATG ATC GCC TTG AAT 
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Val Ser Pro Ser Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn 
1480 1485 1490 

ACT GTT GTG CTG ATG ATG AAG TAT TAT TCT GCT CCC TGT ACC TAT GAG 4 6 86 

Thr Val Val Leu Met Met Lys Tyr Tyr Ser Ala Pro Cys Thr Tyr Glu 
1495 1500 1505 

CTG GCC CTG AAG TAC CTG AAT ATC GCC TTC ACC ATG GTG TTT TCC CTG 4734 
Leu Ala Leu Lys Tyr Leu Asn He Ala Phe Thr Met Val Phe Ser Leu 
1510 1515 1520 

GAA TGT GTC CTG AAG GTC ATC GCT TTT GGC TTT TTG AAC TAT TTC CGA 4 782 

Glu Cys Val Leu Lys Val He Ala Phe Gly Phe Leu Asn Tyr Phe Arg 
1525 1530 1535 

GAC ACC TGG AAT ATC TTT GAC TTC ATC ACC GTG ATT GGC AGT ATC ACA 4830 
Asp Thr Trp Asn He Phe Asp Phe He Thr Val He Gly Ser He Thr 
1540 1545 1550 1555 

GAA ATT ATC CTG ACA GAC AGC AAG CTG GTG AAC ACC AGT GGC TTC AAT 4878 
Glu He He Leu Thr Asp Ser Lys Leu Val Asn Thr Ser Gly Phe Asn 
1560 1565 1570 

ATG AGC TTT CTG AAG CTC TTC CGA GCT GCC CGC CTC ATA AAG CTC CTG 4926 
Met Ser Phe Leu Lys Leu Phe Arg Ala Ala Arg Leu He Lys Leu Leu 
1575 1580 1585 

CGT CAG GGC TAT ACC ATA CGC ATT TTG CTG TGG ACC TTT GTG CAG TCC 4 974 

Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val Gin Ser 
1590 1595 1600 

TTT AAG GCC CTC CCT TAT GTC TGC CTT TTA ATT GCC ATG CTT TTC TTC 5022 
Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu Phe Phe 
1605 1610 1615 

ATT TAT GCC ATC ATT GGG ATG CAG GTA TTT GGA AAC ATA AAA TTA GAC 5070 
He Tyr Ala He He Gly Met Gin Val Phe Gly Asn He Lys Leu Asp 
1620 1625 1630 1635 

GAG GAG AGT CAC ATC AAC CGG CAC AAC AAC TTC CGG AGT TTC TTT GGG 5118 
Glu Glu Ser His He Asn Arg His Asn Asn Phe Arg Ser Phe Phe Gly 
1640 1645 1650 

TCC CTA ATG CTA CTC TTC AGG AGT GCC ACA GGT GAG GCC TGG CAG GAG 5166 
Ser Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp Gin Glu 
1655 1660 1665 

ATT ATG CTG TCA TGC CTT GGG GAG AAG GGC TGT GAG CCT GAC ACC ACC 5214 
He Met Leu Ser Cys Leu Gly Glu Lys Gly Cys Glu Pro Asp Thr Thr 
1670 1675 1680 

GCA CCA TCA GGG CAG AAC GAG AAT GAA CGC TGC GGC ACC GAT CTG GCC 5262 
Ala Pro Ser Gly Gin Asn Glu Asn Glu Arg Cys Gly Thr Asp Leu Ala 
1685 1690 1695 

TAC GTG TAC TTT GTC TCC TTC ATC TTC TTC TGC TCC TTC TTG ATG CTC 5310 
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Tyr Val Tyr Phe Val Ser Phe He Phe Phe Cys Ser Phe Leu Met Leu 
1700 1705 1710 1715 

AAC CTG TTT GTG GCC GTC ATC ATG GAC AAC TTT GAG TAC CTG ACT CGG 
Asn Leu Phe Val Ala Val He Met Asp Asn Phe Glu Tyr Leu Thr Arq 
1720 1725 1730 

GAC TCC TCC ATC CTG GGG CCT CAC CAC TTG GAC GAG TTT GTC CGC GTC 
Asp ser ser He Leu Gly Pro His His Leu Asp Glu Phe Val Arg Val 
1735 1740 1745 

TGG GCA GAA TAT GAC CGA GCA GCA TGT GGC CGC ATC CAT TAC ACT GAG 
Trp Ala Glu Tyr Asp Arg Ala Ala Cys Gly Arg He His Tyr Thr Glu 
1750 1755 1760 

ATG TAT GAA ATG CTG ACT CTC ATG TCA CCT CCG CTA GGC CTC GGC AAG 

Met LeU Thr Leu Met Ser Pro Pro Leu Q ly Leu Gly Lys 
1765 1770 1775 

AGA TGT CCC TCC AAA GTG GCA TAT AAG AGG TTG GTC CTG ATG AAC ATG 
Arg Cys Pro Ser Lys Val Ala Tyr Lys Arg Leu Val Leu Met Asn Met 
1780 17 8 5 179Q 1795 

CCA GTA GCT GAG GAC ATG ACG GTC CAC TTC ACC TCC ACA CTT ATG GCT 
Pro Val Ala Glu Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala 
1800 1805 1810 

CTG ATC CGG ACA GCT CTG GAC ATT AAA ATT GCC AAA GGT GGT GCA GAC 
Leu He Arg Thr Ala Leu Asp He Lys He Ala Lys Gly Gly Ala Asp 
1815 1820 1825 

AGG CAG CAG CTA GAC TCA GAG CTA CAA AAG GAG ACC CTA GCC ATC TGG 
Arg Gin Gin Leu Asp Ser Glu Leu Gin Lys Glu Thr Leu Ala He Trp 
1830 1835 1840 

CCT CAC CTA TCC CAG AAG ATG CTG GAT CTG CTT GTG CCC ATG CCC AAA 
Pro His Leu Ser Gin Lys Met Leu Asp Leu Leu Val Pro Met Pro Lvs 
1845 1850 1855 

GCC TCT GAC CTG ACT GTG GGC AAA ATC TAT GCA GCA ATG ATG ATC ATG 
Ala Ser Asp Leu Thr Val Gly Lys He Tyr Ala Ala Met Met He Met 
!860 1865 1870 1875 

GAC TAC TAT AAG CAG AGT AAG GTG AAG AAG CAG AGG CAG CAG CTG GAG 
Asp Tyr Tyr Lys Gin Ser Lys Val Lys Lys Gin Arg Gin Gin Leu Glu 
1880 1885 1890 

GAA CAG AAA AAT GCC CCC ATG TTC CAG CGC ATG GAG CCT TCA TCT CTG 
Glu Gin Lys Asn Ala Pro Met Phe Gin Arg Met Glu Pro Ser Ser Leu 
1895 1900 1905 

CCT CAG GAG ATC ATT GCT AAT GCC AAA GCC CTG CCT TAC CTC CAG CAG 
Pro Gin Glu He He Ala Asn Ala Lys Ala Leu Pro Tyr Leu Gin Gin 
1910 1915 1920 

GAC CCC GTT TCA GGC CTG AGT GGC CGG AGT GGA TAC CCT TCG ATG AGT 
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CCA CTC TCT CCC CAG GAT ATA TTC CAG TTG GCT TGT ATG GAC CCC GCC 
Pro Leu Ser Pro Gin Asp He Phe Gin Leu Ala Cys Met Asp Pro Ala 
1940 1945 1950 1955 

GAT GAC GGA CAG TTC CAA GAA CGG CAG TCT CTG GTG GTG ACA GAC CCT 
Asp Asp Gly Gin Phe Gin Glu Arg Gin Ser Leu Val Val Thr Asp Pro 
1960 1965 1970 

AGC TCC ATG AGA CGT TCA TTT TCC ACT ATT CGG GAT AAG CGT TCA AAT 
Ser Ser Met Arg Arg Ser Phe Ser Thr He Arg Asp Lys Arg Ser Asn 
1975 1980 1965 

TCC TCG TGG TTG GAG GAA TTC TCC ATG GAG CGA AGC AGT GAA AAT ACC 
Ser Ser Trp Leu Glu Glu Phe Ser Met Glu Arg Ser Ser Glu Asn Thr 
1990 1995 2000 

TAC AAG TCC CGT CGC CGG AGT TAC CAC TCC TCC TTG CGG CTG TCA GCC 
Tyr Lys Ser Arg Arg Arg Ser Tyr His Ser Ser Leu Arg Leu Ser Ala 
2005 2010 2015 

CAC CGC CTG AAC TCT GAT TCA GGC CAC AAG TCT GAC ACT CAC CCC TCA 
His Arg Leu Asn Ser Asp Ser Gly His Lys Ser Asp Thr His Pro Ser 
2020 2025 2030 2035 

GGG GGC AGG GAG CGG CGA CGA TCA AAA GAG CGA AAG CAT CTT CTC TCT 
Gly Gly Arg Glu Arg Arg Arg Ser Lys Glu Arg Lys His Leu Leu Ser 
2040 2045 2050 

CCT GAT GTC TCC CGC TGC AAT TCA GAA GAG CGA GGG ACC CAG GCT GAC 
Pro Asp Val Ser Arg Cys Asn Ser Glu Glu Arg Gly Thr Gin Ala Asp 
2055 2060 2065 

TGG GAG TCC CCA GAG CGC CGT CAA TCC AGG TCA CCC AGT GAG GGC AGG 
Trp Glu Ser Pro Glu Arg Arg Gin Ser Arg Ser Pro Ser Glu Gly Arg 
2070 2075 2080 

TCA CAG ACG CCC AAC AGA CAG GGC ACA GGT TCC CTA AGT GAG AGC TCC 
Ser Gin Thr Pro Asn Arg Gin Gly Thr Gly Ser Leu Ser Glu Ser Ser 
2085 2090 2095 

ATC CCC TCT GTC TCT GAC ACC AGC ACC CCA AGA AGA AGT CGT CGG CAG 
He Pro Ser Val Ser Asp Thr Ser Thr Pro Arg Arg Ser Arg Arg Gin 
2100 2105 2110 2115 

CTC CCA CCC GTC CCG CCA AAG CCC CGG CCC CTC CTT TCC TAC AGC TCC 
Leu Pro Pro Val Pro Pro Lys Pro Arg Pro Leu Leu Ser Tyr Ser Ser 
2120 2125 2130 

CTG ATT CGA CAC GCG GGC AGC ATC TCT CCA CCT GCT GAT GGA AGC GAG 
Leu He Arg His Ala Gly Ser He Ser Pro Pro Ala Asp Gly Ser Glu 
2135 2140 2145 

GAG GGC TCC CCG CTG ACC TCC CAA GCT CTG GAG AGC AAC AAT GCT TGG 
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Glu Gly Ser Pro Leu Thr Ser Gin Ala Leu Glu Ser Asn Asn Ala Trp 
2155 2160 

%1» cf T c CC ^ C TCT CCG °^ CCC O" 3 CAG AGG CAA CAT GCC 
Leu Throiu ser Ser Asn Ser Pro His Pro Gin Gin Arg Gin His Ala 
2170 2175 

Ser Srn ^ C A T C TCC GAG CCC TAC ™ ^CC CTG CAC GAA GAC 

2^80 **» HU SeZ G1U Pr ° *** Leu Ala Leu His gIS 2p 

2185 2190 2195 

Ser Ss S IS S A ° 2* GT ? GAG GAG GAG ACG CTC ACT ™ GAA GCA 
Ser His Ala Ser Asp Cys Val Glu Glu Glu Thr Leu Thr Phe Glu Ala 

2200 2205 2210 

Alf vl? ACT AGC CTG GGC CGT TCC A CC ATC GGC TCA GCC CCA 

Ala Val Ala Thr Ser Leu Gly Arg Ser Asn Thr He Gly Ser 2£ £J 

2215 2220 2225 

CCC CTG CGG CAT AGC TGG CAG ATG CCC AAC GGG CAC TAT CGG CGG CGG 
Pro Leu Ar^His Ser Trp Gin Me^Pro Asn Gly His gjjg Arg Ar? 

it- 2 GG CCT GGG CCA 6(30 ATG ATG TGT GGG GCT GTC AAC AAC 

Arg Arg Gly Gly Pro Gly Pro Gly Met Met Cys Gly Ala 55 A^n 
2250 2255 

S 2 £ g SJ SJ £ « ~ — «« TCCCCCCTCC 

2260 2265 ' 2 270 

GATGCATGCT CTTCTCTCAC ATGGAGAAAA CCAAGACAGA ATTGGGAAGC CAGTGCGGCC 
CCG CGGGGAG GAAGAGGGAA AAGGAAGATG GAAG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2634 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



7055 
7089 



FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1983 

(D) OTHER INFORMATION: /standard_name= 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

mI G vT AGG GAC ATG TCC TCT CCT CCC ACA CCG GCG GCG GCG 

Met Val Gin Arg Asp Met Ser Lys Ser Pro Pro Thr Pro Ala Ala Ala 
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GTG GCG CAG GAG ATC CAG ATG GAA CTG CTA GAG AAC GTG GCT CCC GCG 
Val Ala Gin Glu lie Gin Met Glu Leu Leu Glu Asn Val Ala Pro Ala 
20 25 30 

GGG GCG CTC GGA GCC GCC GCA CAG TCA TAT GGA AAA GGA GCC AGA AGG 
Gly Ala Leu Gly Ala Ala Ala Gin Ser Tyr Gly Lys Gly Ala Arg Arg 
35 40 45 

AAA AAC AGA TTT AAA GGA TCT GAT GGA AGC ACG TCA TCT GAT ACT ACC 
Lys Asn Arg Phe Lys Gly Ser Asp Gly Ser Thr Ser Ser Asp Thr Thr 
50 55 60 

TCA AAT AGT TTT GTT CGC CAG GGT TCG GCA GAC TCC TAC ACT AGC CGT 
Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg 
65 70 75 80 

CCA TCC GAT TCC GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC 
Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg 
85 90 95 

AGA GAA GCG GAG CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA 
Arg Glu Ala Glu Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr 
100 105 110 

AAG CCC GTT GCA TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC 
Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala 
115 120 125 

CAT GAA GAT GAT GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA 
His Glu Asp Asp Val Pro Val Pro Gly Met Ala He Ser Phe Glu Ala 
130 135 140 

AAA GAT TTT CTG CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA 
Lys Asp Phe Leu His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp He 
145 150 155 160 

GGG CGA TTG GTA AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA 
Gly Arg Leu Val Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro 
165 170 175 

GTC AAA CTA GAA AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA 
Val Lys Leu Glu Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin 
180 185 190 

GGG AAA TTC TAC TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG 
Gly Lys Phe Tyr Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu 
195 200 205 

GGT GAC ATA GTA CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT 
Gly Asp He Val Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala 
210 215 220 

ATA GAC ATA GAT GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA 
He Asp He Asp Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro 
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GCA AAC CAC CGC TCC CCT AAA CCC AGT GCA AAC AGT GTA ACG TCA CCC 
Ala Asn His Arg Ser Pro Lys Pro Ser Ala Asn Ser VaY ?£r SeJ Pro 
245 250 255 

l~ ni G *** AGA ATG CCC TTC m AAG AAG ACA GAG CAC ACT 

His Ser Lys Glu Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr 

260 265 270 

CCT CCG TAT GAT GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GTG CCC 
Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro S3 55 2 S2 S?5 



285 

Pro III fl?, GGC I AC GAG GTC ACA ^ ATG A TG CAA AAA GCG CTG 

Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu 
295 300 

HI GAT TP TTA AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG 
Phe Asp Phe Leu Lys His Arg Phe Glu Gly Arg He sir ill Arg 
310 315 320 

GTC ACC GCT GAC ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC 
Val Thr Ala Asp lie Ser Leu Ala Lys Arg Ser Val ™ JJn So 

325 330 335 

AGT AAG CAC GCA ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG 
Ser Lys Hxs Ala He He Glu Arg Ser Asn Thr Arg Ser Ser Leu* 22 
340 345 3so 

GAA GTT CAG AGT GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG 
Glu Val Gin Ser Glu He Glu Arg He Phe Glu Leu A ia £rg JS llu 
■»« 360 365 

2S Li. vJ? v TC t CTT GAC G ? G GAT ACA ATT C CA GCT CAA CTC 

Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu 
370 375 3 8 o 

AG Z ?** tZ C Z CC 7*° GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT 
Ser Lys Thr Ser Leu Ala Pro He He Val Tyr Val Lys He Ser Ser 
385 390 395 400 

Pr« ^ T A AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT 

Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala 
405 410 415 

AAA CAC CTC AAC GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT 
Lys His Leu Asn Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys 
4 20 425 * 430 

CCT CCA GAG CTG TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT 
Pro Pro Glu Leu Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp 
435 440 445 

GCC TGT GAG CAC CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC 
Ala Cys Glu His Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr 
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CAT CCT CCC AGC AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA 
His Pro Pro Ser Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu 
465 470 475 480 

GCC ACT TCA AGT CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG 
Ala Thr Ser Ser Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin 
485 490 495 

GGT TCT CAA GGT GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT 
Gly Ser Gin Gly Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser 
500 505 510 

GCT TCC CAA GCT GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC 
Ala Ser Gin Ala Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser 
515 520 525 

CAG CAC CGC TCT TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG 
Gin His Arg Ser Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly 
530 535 540 

ACA AGT CGC GGC CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG 
Thr Ser Arg Gly Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin 
545 550 555 560 

GAG AGT CGA GAC TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT 
Glu Ser Arg Asp Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His 
565 570 575 

GAC CAC GTG GAC CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC 
Asp His Val Asp His Tyr Ala Ser His Arg Asp His Asn His Arg Asp 
580 585 590 

GAG ACC CAC GGG AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT 
Glu Thr His Gly Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg 
595 600 605 

TCC CGG GAC GTG GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG 
Ser Arg Asp Val Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin 
610 615 620 

CGC AGC CGT CAT AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA 
Arq Ser Arg His Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu 
625 630 635 640 

GTG ATA TCA AAA AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT 
Val He Ser Lys Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val 
645 650 655 

TAC ATC CCC CAR TGAGTTTTGC CCTTTTGTGT ttttttTTTT TTTTTTTTGA 
Tyr He Pro Gin 
660 

AGTCTTGTAT AACTAACAGC ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTGCAATC 
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CAATAGCATG 
TGGTATGGCT 
GCCACTTGAA 
ATGTATCCAA 
TTTTAATACG 
TGCTTGTACC 
GATCATAAAG 
TAAATGTTAT 
GTTTGATCAC 



ATATGTGATC TGTCTTGTAA TATTTTGTAT TATTGCTGTT GCTTGAATAG 
GATAGAGTAT TGAGATACTT TTTCTTTTGT AAGTGCTACA TAAATTGGCC 
GCAGTCCTCC GGTTGCATAC TGGACTCTTC AAAAACTGTT TTGGGTAGCT 
CAAAATCTGT TGCCACCCAG GTGATGTTAG TGTTTTAAGA AATGTAGTTG 
CAAGCCAGAA TCAGCACAGA TAAAAAGTGG AATTTCTTGT TTCTCCAGAT 
TTAATACGCA GGCATCTGAT TTGCATATTC ATTCATGGAC CACTGTTTCT 
TCTGGCTGAC TAAATTTGGG GACAGATTCA GTCTTGCCTT ACACAAAGGG 
TTAGAATCTA TTTTCTATGT ACTAGTACTG TGTACTGTAT AGACAGTTTG 
TTCTGCAAAC AAACACCTCC TTATTATATA TAATATATAT ATATATATCA 
ACTATTTTAG AGTC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 69.. 1631 

(D) OTHER INFORMATION: /standard_name« »Beta-4" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGCCCAGCCT CGGGGGCCAG CCCCCTCCGC CCACCGCACA CGGGCTGGCC ATGCGGCGGC 
TCTGAACG ATG TCC TCC TCC TCC TAC GCC AAG AAC GGG ACC GCG GAC GGG 
Met Ser Ser Ser Ser Tyr Ala Lys Asn Gly Thr Ala Asp Gly 

CCG CAC TCC CCC ACC TCG CAG GTG GCC CGA GGC ACC ACA ACC CGG AGG 
Pro His Ser Pro Thr Ser Gin Val Ala Arg Gly Thr Thr Thr Arg Arg 
15 20 25 30 

sir Tlf, ^ A I° C GAT GGC AGC ACC ACT TCG ACC A< 3C TTC ATC 

Ser Arg Leu Lys Arg Ser Asp Gly Ser Thr Thr Ser Thr Ser Phe He 
35 40 45 

AGA S AG G ? T TCA GCG GAT TCC TAC ACA AG ^ AGG CCG TCT GAC TCC 
Leu Arg Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser 



2140 
2200 
2260 
2320 
2380 
2440 
2500 
2560 
2620 
2634 
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GAT GTC TCT TTG GAA GAG GAC CGG GAA GCA ATT CGA CAG GAG AGA GAA 
Asp Val Ser Leu Glu Glu Asp Arg Glu Ala He Arg Gin Glu Arg Glu 
55 70 75 

CAG CAA GCA GCT ATC CAG CTT GAG AGA GCA AAG TCC AAA CCT GTA GCA 
Gin Gin Ala Ala He Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala 

80 85 90 

TTT GCC GTG AAG ACA AAT GTG AGC TAC TGC GGC GCC CTG GAC GAG GAT 
Phe Ala Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp Glu Asp 
95 * 100 105 HO 

GTG CCT GTT CCA AGC ACA GCT ATC TCC TTT GAT GCT AAA GAC TTT CTA 
Val Pro Val Pro Ser Thr Ala He Ser Phe Asp Ala Lys Asp Phe Leu 
US 120 I 25 

CAT ATT AAA GAG AAA TAT AAC AAT GAT TGG TGG ATA GGA AGG CTG GTG 
His He Lys Glu Lys Tyr Asn Asn Asp Trp Trp He Gly Arg Leu Val 
130 135 140 

AAA GAG GGC TGT GAA ATT GGC TTC ATT CCA AGT CCA CTC AGA TTG GAG 
Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Leu Arg Leu Glu 
145 ISO 155 

AAC ATA CGG ATC CAG CAA GAA CAA AAA AGA GGA CGT TTT CAC GGA GGG 
Asn He Arg He Gin Gin Glu Gin Lys Arg Gly Arg Phe His Gly Gly 
160 165 170 

AAA TCA AGT GGA AAT TCT TCT TCA AGT CTT GGA GAA ATG GTA TCT GGG 
Lys Ser Ser Gly Asn Ser Ser Ser Ser Leu Gly Glu Met Val Ser Gly 
175 180 185 

ACA TTC CGA GCA ACT CCC ACA TCA ACA GCA AAA CAG AAG CAA AAA GTG 
Thr Phe Arg Ala Thr Pro Thr Ser Thr Ala Lys Gin Lys Gin Lys Val 
195 200 

ACG GAG CAC ATT CCT CCT TAC GAT GTT GTA CCG TCA ATG CGT CCG GTG 
T^r G?u Ss iVe Pro Pro Tyr Asp val Val Pro Ser Met Arg Pro Val 
210 215 220 

GTG TTA btu ^ — — _ GGT TAC GAG GTA ACA GAC . 

Val Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met 

225 230 
CAG AAA GCC CTC TTT GAT TCC CTG AAG CAC AGG TTT GAT GGG AGG ATT 
Sn Lys Ala Leu Phe Asp Ser Leu Lys His Arg Phe Asp Gly Arg He 
240 245 250 

S S S 5 SS S S SS 5 5 i S K 55 IS S 

255 260 265 

S5 52 5£ S 55 SSSS S S5 S S SS S S 

275 280 
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= = = a a a s a s s a s s ss 2 

295 300 

S 2 2 2 Si 2 2 2 S 2 S 2 S S SS £ 

310 315 

S S Si 2 2 2 s 2 2 s 2 s s s as s 

325 330 

i SS 2 2 2 ss s a SS s S S 2 5 a 2 



350 



"-22E2SS2SS22222222 

360 365 

S S Si 2 2 2 Si K 2 Asp 2 S S 5 Si SS 

375 380 

*2 2 ss 2 s Si s; ss Si s; ss s 2 s 2 

390 395 



5 a s s as s s a a g 2 Met 2 s 2 2 

405 410 



2 2 SS 2 2 5 2 S 2 2 2 2 2 2 S S 

12 is 2 ss 2 ss a Se? a as 5 ss as 2 a ss 

J5 440 445 

SS 12 S S Si S SS 2 2 S 2 2 2 Si SS £ 



460 



as ss ssss a 2 a ss a ss 2 2 2 2 
*ss as 2 2 a as ss 2 2 2 si ss 2 2 2 2 

2 2 G 2 2 2 2 2 2 SS 2 SS 2 2 2 2 2 

505 510 
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GGA TAT AGC CAT GAC TCC CGA CAT AGG CTT TGAGTCTAAT GAAACAAAAA 1648 
Gly Tyr Ser His Asp Ser Arg His Arg Leu 
515 520 

ATATTCATCT GTTGACAATT TGCCATAGCA GTGCTAGGAT AAACCAATCA TCTTAACTTG 1708 

GCTAACATAG CACAGTATTT ACTGTGCTAA TGGGCTGCTG TCATTTTATG CTAAGTAAGG 1768 

GGCAAAAAAA AAAATTACAT TATGCCCTTG AGTCTAGATG GATATTAGAT GCCCG 1823 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Ser Ser Ser Ser Tyr Ala Lys Asn Gly Thr Ala Asp Gly Pro His 
1 ,5 10 15 

Ser Pro Thr Ser Gin Val Ala Arg Gly Thr Thr Thr Arg Arg Ser Arg 
20 25 30 

Leu Lys Arg Ser Asp Gly Ser Thr Thr Ser Thr Ser Phe lie Leu Arg 
35 40 45 

Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser Asp Val 
50 55 60 

Ser Leu Glu Glu Asp Arg Glu Ala He Arg Gin Glu Arg Glu Gin Gin 
65 70 75 80 

Ala Ala He Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala Phe Ala 
85 90 95 

Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp Glu Asp Val Pro 
100 105 HO 

Val Pro Ser Thr Ala He Ser Phe Asp Ala Lys Asp Phe Leu His He 
115 120 125 

Lys Glu Lys Tyr Asn Asn Asp Trp Trp He Gly Arg Leu Val Lys Glu 
130 135 140 



Arg He Gin Gin Glu Gin Lys Arg Gly Arg Phe His Gly Gly Lys Ser 
165 170 175 

Ser Gly Asn Ser Ser Ser Ser Leu Gly Glu Met Val Ser Gly Thr Phe 
180 185 190 
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Arg Ala Thr Pro Thr Ser Thr Ala Lys Gin Lys Gin Lys Val Thr ( 

195 200 205 

His He Pro Pro Tyr Asp Val Val Pro ' 
210 215 

Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys 
230 235 2 i 0 

Ala Leu Phe Asp Ser Leu Lys His Arg Phe Asp Gly Arg lie ser He 

245 250 255 

Thr Arg Val Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val Leu Asn 

265 270 
Asn Pro Ser Lys Arg Ala He He Glu Arg Ser Asn Thr Arg Ser Ser 
280 285 

Leu Ala Glu Val Gin Ser Glu He Glu Arg He Phe Glu Leu Ala Arg 
295 300 

Ser Leu Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala 
310 315 320 



Gin Leu He Lys Thr Ser Leu Ala Pro He He Val His Val Lys 
330 --- 



Val 



Ser Ser Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser 



350 



Gin Ser Lys His Leu Asn Val Gin Leu Val Ala Ala Asp Lys Leu Ala 
J " 360 36S 

Gin Cys Pro Pro Glu Met Phe Asp Val He Leu Asp Glu Asn Gin Leu 

J70 375 380 

Glu Asp Ala Cys Glu His Leu Gly Glu Tyr Leu Glu Ala Tyr Trp Arg 

Ala Thr His Thr Thr Ser Ser Thr Pro Met Thr Pro Leu Leu Gly Ara 

405 410 415 

Asn Leu Gly Ser Thr Ala Leu Ser Pro Tyr Pro Thr Ala He Ser Gly 
420 425 430 

Leu Gin Ser Gin Arg Met Arg His Ser Asn His Ser Thr Glu Asn Ser 
4 35 440 445 

Pr ° lit G1U ** 9 ^ Ser Leu Met Thr Ser Glu Asn Tyr His Asn 

450 455 460 

Glu Arg Ala Arg Lys Ser Arg Asn Arg Leu Ser Ser Ser Ser Gin His 
465 470 475 480 

Ser Arg Asp His Tyr Pro Leu Val Glu Glu Asp Tyr Pro Asp Ser Tyr 
465 490 495 
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Gin Asp Thr Tyr Lys Pro His Arg Asn Arg Gly Ser Pro Gly Gly Tyr 
500 505 510 

Ser His Asp Ser Arg His Arg Leu 
515 520 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3346 

(D) OTHER INFORMATION: /standard_name= "Alpha -2a" 

(ix) FEATURE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3 'UTR 

(B) LOCATION: 3347.. 3636 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 



WO 95/04822 



PCT/US94/09230 



-232- 



G?S Zll t GC ** C £ GA TCT m GCC CTG GTG AGC CTG GCA TTG 

Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 

90 95 ioo 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 110 us 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe lie 
135 "0 145 150 

GCT ^ T ^ CGA ATA TCT TAT GCA GCA GTC 

Glu Asp Ala Asn Phe Gly Arg Gin lie Ser Tyr Gin His Ala Ala Val 
155 160 165 

SI tTZ vrl f C r? C TAT GAG GQC TCA ACA ATT GTG TTA AAT GAA 

Hie He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
17 ° 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 2.9S 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 
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AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 
Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys Glu Asn 
395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 
Lys Gly Tyr Tyr Tyr Glu lie Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 
Glv Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG CCT ATT GGT GTA 
Glv Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Pro He Gly Val 
520 525 530 
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GGT ATA CCA ACA ATT AAT TTA AGA AAA AGG AGA CCC AAT ATC CAG AAC 1684 
Gly lie Pro Thr He Asn Leu Arg Lys Arg Arg Pro Asn He Gin Asn 
535 540 545 550 

CCC AAA TCT CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA 1732 
Pro Lys Ser Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu 
555 560 565 

GAG AAT GAT ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA 1780 
Glu Asn Asp He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu 
570 575 580 

AGT GGA GAA AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA 1828 
Ser Gly Glu Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg 
585 590 595 

TAT ATT GAC AAA GGA AAC AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC 1876 
Tyr He Asp Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly 
600 605 610 

ACA GAT TAC AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAC TAT 1924 
Thr Asp Tyr Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr 
615 620 625 630 

ATA AAA GCC AAA CTA GAA GAG ACA ATA ACT CAG GCC AGA TAT TCG GAA 1972 
He Lys Ala Lys Leu Glu Glu Thr He Thr Gin Ala Arg Tyr Ser Glu 
635 640 645 

ACC CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA 2020 
Thr Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe lie Ala 
650 655 660 

CCA AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA 2068 
Pro Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu 
665 670 675 

TTT CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC 2116 
Phe Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn 
680 685 690 

CCA TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC 2164 
Pro Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly 
695 700 705 710 

TTT ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC 2212 
Phe Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He 
715 720 725 

AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA 2260 
Lys Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg 
730 735 740 

GTT TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA 2308 
Val Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr 
745 750 755 
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TAT GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT ART GAT AAC TAT GTT 
Tyr Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val 
760 765 770 

TTC ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG 
Phe Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser 
775 7B0 785 790 

GGC ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT 
Gly He Met Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu 
795 800 805 

CTT AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA 
Leu Lys Pro Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He 
810 815 820 

GAG AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT 
Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val 
825 830 835 

TGT GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT 
Cys Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp 
840 845 850 

GAT GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG 
Asp Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin 
,855 860 865 870 

ATT GGA AGA TTT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG 
He Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu 
875 880 885 

GTT AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA 
Val Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser 
890 895 900 

GTA TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA 
Val Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser 
905 910 915 

GCA TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC 
Ala Tyr Val Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala 
920 925 930 

ACT GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC 
Thr Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr 
935 940 945 950 

TTT CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG 
Phe Pro Arq Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr 
955 9G0 965 

GCC TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC 
Ala Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe 
970 975 980 
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TTC GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC 3028 
Phe Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn 
985 990 995 

TGT TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA 3 076 

Cys ser Arg He Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He 
1000 1005 1010 

TTC ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG 3124 
Pne lie Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu 
1015 1020 1025 1030 

CTC ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG 3172 
Leu lie Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met 
1035 1040 1045 

GTT AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC 3220 
Val Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn 
1050 1055 1060 

AAT GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT 3268 
Asn Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn 
1065 1070 1075 

CCC TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG 3316 
Pro Ser Leu Trp Tyr He He Gly He Gin Phe Leu Leu Leu Trp Leu 
1080 1085 1090 

GTA TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACCAAAT 3363 
Val Ser Gly Ser Thr His Arg Leu Leu 
1095 lioo 

CTGCATAGTT AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACG 3423 

TAGGGTCAGC TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA 34 83 

GGCGCAGACT CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTGT 3543 

GTGAATGCTG CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG TCCTCTATTG 3603 

GAAAATTTGG GCGTTTGTTG TTGCATTGTT GGT 3636 



INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3295 
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(D) OTHER INFORMATION: /standard_name= "Alpha-2c" 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION : 3296.-3585 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 

GCGGGGGAGG GGG CATTG AT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 

Met Ala Ala Gly cys ljeu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
gS Pro A^n aS Arg Gin Leu Val Glu He Ala Ala Arg Asp He 

75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 

90 95 AUU 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 

105 "° 115 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 

120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Lys A^n Sp ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe He 
135 140 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala va^ 
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CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

n^ft ^ AC S CT TF* 71,(3 TGG M GTT TTT GGC AGT GCC ACT GGC CTA 

Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg tS pS 
215 220 225 230 

tel tT T GAC t CTT 1 AT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 

Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

o?, A G ? T GCA ? CT CCT *** GAC ATG 0X7 ATT GTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
25 ° 255 260 

ACT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
3*5 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 
Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 
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CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys Glu Asn 

395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 4B5 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala lie Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG GAG CCA GTA ACA 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Glu Pro Val Thr 



TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT AAA GTG GAG ATT 
Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp He Lys Val Glu 
535 540 =45 

rra AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA AAA ACA TTC AGA ACT 
Sg Hn Ss Se? He Sp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 
III SS £yt ser SS Asp Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
570 575 3eu 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 
Ty"r tS ?r? Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
585 590 595 

£ £ £ Tyr S S Tyr £ S SSS S Si 5S 

600 * 605 610 
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rJt ^ ^ GC ° ^ GA TCA *** AAG GGC ^ ATG AAG GAT TCG GAA ACC 
lie Thr Gin Ala Arg Ser Lys Lys Gly Lys Met Lys Asp Ser Glu Thr 
615 620 625 630 

CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA CCA 
Leu Lys Pro Asp Asn Phe Glu Glu ser Gly Tyr Thr Phe He Ala Pro 
635 640 645 

AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT 
Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe 
650 655 660 

fT? ™* ^3 " C f^C GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA 
Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn Pro 
665 670 675 

TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT 
SfS Ala Leu Ile Asn Arg Val Leu Leu Asp Ala Gly Phe 

680 685 690 

Thr r** P T ?, T ? AAT TAC TGG ACT AAG CAG AAA AAT ATC AAG 

Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn Ile Lys 
695 700 705 710 

GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT 
Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly Ile Thr Arg Val 
715 720 725 

J^! GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT 

Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tvr 
730 735 740 

GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC 
Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe 
745 750 755 

ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC 
Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly 
760 765 770 

ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT 
Ile Met Val Ser Lys Ala Val Glu Ile Tyr Ile Gin Gly Lys Leu Leu 
775 780 785 " 790 

AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG 
Lys Pro Ala Val Val Gly Ile Lys Ile Asp Val Asn Ser Trp Ile Glu 
795 800 805 

AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT 
Asn Phe Thr Lys Thr Ser Ile Arg Asp Pro Cys Ala Gly Pro Val Cys 
810 815 820 

GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT 
Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val Ile Leu Asp Asp 
825 830 835 
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GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG ATT 
Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin He 
840 845 850 

GGA AGA TTT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG GTT 
Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu Val 
855 860 " 865 «™ 



870 



AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA 
Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val 

875 880 885 

TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA 
Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala 
890 895 900 

TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT 
Tyr Val Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala Thr 
905 910 915 

GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT 
Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe 
920 * 925 930 

CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG GCC 
Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala 
935 940 945 950 

TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC TTC 
Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe 
955 960 965 

GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT 
Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys 



TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC 
Ser Arg He Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He Phe 
985 990 995 

ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC 
He Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu 
1000 1005 1010 

ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT 
He Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val 
1015 1020 1025 1030 

AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT 
Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp^Val Cys Phe Asp Asn^Asn 



1035 



1040 



GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT CCC 
val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro 
1050 1055 1060 

TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG GTA 
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Ser Leu Trp Tyr He He Gly He Gin Phe Leu Leu Leu Trp Leu Val 
• LObb 1070 1075 

Ser J£ £ £s £g Su S TGACCTTCTA ^CAAAT CTG CATAGTT 

1080 i 08 5 

AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCAGC 
TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA GGCGCAGACT 
CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG 
CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG 
GCGTTTGTTG TTGCATTGTT GGT 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 564 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(BJ LOCATION: 35.. 3374 (01625 to 1539 & O1908 to 1928) 
(D) OTHER INFORMATION: /standard_name= »Alpha-2d» 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3375.. 3565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asd 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 



3382 
3442 
3505 
3562 
3585 



WO 95/04822 



PCT/US94/09230 



-243- 



AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 
Jsn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp lie 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 

105 HO ll 5 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
sir Asn Glu val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 

120 125 13° 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe lie 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 ■•-« 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 
His He Pro ¥nr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 

185 I 90 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
gJ5 As"p pS sS Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 

200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
SI Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
Sn iys lie Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 

235 240 ^ 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 

250 255 
AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val ser nxu 
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ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
U Thr Leu Ser As P As P phe Val A sn Val Ala Ser Phe 

280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 

315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 
Ala Lys Gly lie Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He lie 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu lie i>ne iJS 
360 365 370 

^ *^ GTA CGT GTA TTC AGG TTT TCA GTT GGT 

" Gly 



T ,,_ - — ««» ill. ftUO 111 lUi U1T 

Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val 
375 380 



385 



CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 
Gin His Asn Tyr Glu Arg Gly Pro lie Gin Trp Met Ala Cys Glu Asn 

395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 * 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
4 40 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 450 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 
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CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala lie Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG GAG CCA GTA ACA 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Glu Pro Val Thr 



TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT AAA GTG GAG ATT 
Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp He Lys Val Glu lie 
535 540 545 550 

CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA AAA ACA TTC AGA ACT 
Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 
555 560 565 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 
Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
570 575 580 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 
Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
585 590 595 

TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC AAA CTA GAA GAG ACA 
Leu Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala Lys Leu Glu Glu Thr 
600 605 610 

ATA ACT CAG GCC AGA TAT TCG GAA ACC CTG AAG CCA GAT AAT TTT GAA 
He Thr Gin Ala Arg Tyr Ser Glu Thr Leu Lys Pro Asp Asn Phe Glu 
615 620 625 630 

GAA TCT GGC TAT ACA TTC ATA GCA CCA AGA GAT TAC TGC AAT GAC CTG 
Glu Ser Gly Tyr Thr Phe He Ala Pro Arg Asp Tyr Cys Asn Asp Leu 
635 640 645 

AAA ATA TCG GAT AAT AAC ACT GAA TTT CTT TTA AAT TTC AAC GAG TTT 
Lys He Ser Asp Asn Asn Thr Glu Phe Leu Leu Asn Phe Asn Glu Phe 
650 655 660 

ATT GAT AGA AAA ACT CCA AAC AAC CCA TCA TGT AAC GCG GAT TTG ATT 
He Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu He 

665 670 675 

AAT AGA GTC TTG CTT GAT GCA GGC TTT ACA AAT GAA CTT GTC CAA AAT 
A^n £rg vll Leu ill Asp Ala Gly Phe Thr Asn Glu Leu Val Gin Asn 

680 685 69° 

TAC TGG AGT AAG CAG AAA AAT ATC AAG GGA GTG AAA GCA CGA TTT GTT 
Tyr Trp Ser Lys Gin Lys Asn He Lys Gly val Lys Ala Arg Phe Val 
695 700 705 

GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT CCC AAA GAG GCT GGA GAA 
val Thr Asp Gly Gly He Thr Arg Val Tyr Pro Lys Glu Ala Gly Glu 
715 720 
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A^n ^ ^ C CCA GAG ACA TAT GAG GAC AGC TTC TAT AAA AGG 

Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp Ser Phe Tyr Lys Arg 

sir f^ A f T t** GAT AAC TAT GTT ACT GCT CCC TAC TTT AAC AAA 

Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe Asn Lys 
745 750 755 

o G * GGA GCT G ? T GCC TAT GAA TCG GGC ATT ATG GTA AGC AAA GCT GTA 
Ser Gly Pro Gly Ala Tyr Glu Ser Gly lie Met Val Ser Lys Ala Val 
760 765 77 0 

ni A ?I ^ GGG *** CTT CTT ■kAA CCT GCA GTT GTT GGA ATT 

Glu lie Tyr He Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly He 
775 780 785 * 

AAA ATT GAT GTA AAT TCC TGG ATA GAG AAT TTC ACC AAA ACC TCA A-rr 
Lys He Asp Val Asn Ser Trp He Glu Asn S J£ JJJ i£ sS He 

795 800 805 

AGA GAT CCG TGT GCT GGT CCA GTT TGT GAC TGC AAA AGA AAC AST Gar 
Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys Lys Arg £n ter 2p 
810 815 820 

Sit S ™ £f T G A ? T CTG GAT GGT GGG TTT CTT CTG ATG GCA 

Val Met Asp cys Val lie Leu Asp Asp Gly Gly Phe Leu Leu Met Ala 
825 830 B35 

A^n ? AT » AT I AT A 5 T CAG ATT GGA AGA TTT TTT GGA GAG ATT 

fi^n P Thr Asn Gln Ile G1 y ^9 ph * p ^e Gly Glu lie 

840 845 850 

GAT CCC AGC TTG ATG AGA CAC CTG GTT AAT ATA TCA GTT TAT GCT TTT 
Asp Pro Ser Leu Met Arg His Leu Val Asn lie Ser Val Tyr Ala pS 
855 860 865 870 

AAC AAA TCT TAT GAT TAT CAG TCA GTA TGT GAG CCC GGT GCT GCA CCA 
Asn Lys Ser Tyr Asp Tyr Gin Ser Val Cys Glu Pro Gly Ala Ala Pro 
875 880 885 

GAA G ? A GCA GGA CAT CGG TCA GCA TAT GTG CCA TCA GTA GCA GAC 
Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val Pro Ser Val Ala Asp 
890 895 900 

ATA TTA CAA ATT GGC TGG TGG GCC ACT GCT GCT GCC TGG TCT ATT CTA 
Ile Leu Gin He Gly Trp Trp Ala Thr Ala Ala Ala Trp Ser He Leu 
905 gio 91 % 

CAG CAG TTT CTC TTG AGT TTG ACC TTT CCA CGA CTC CTT GAG GCA GTT 
Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Arg Leu Leu Glu Ala Val 
920 925 930 

GAG ATG GAG GAT GAT GAC TTC ACG GCC TCC CTG TCC AAG CAG AGC TGC 
Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu Ser Lys Gin Ser Cvs 
935 940 945 ' g | 0 

ATT ACT GAA CAA ACC CAG TAT TTC TTC GAT AAC GAC AGT AAA TCA TTC 
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Ile Thr Glu Gin Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys Ser Phe 
955 960 965 

AGT GGT GTA TTA GAC TGT GGA AAC TGT TCC AGA ATC TTT CAT GGA GAA 29B0 
Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg lie Phe His Gly Glu 
970 975 980 

AAG CTT ATG AAC ACC AAC TTA ATA TTC ATA ATG GTT GAG AGC AAA GGG 3028 
Lys Leu Met Asn Thr Asn Leu He Phe He Met val Glu Ser Lys Gly 
985 990 995 

ACA TGT CCA TGT GAC ACA CGA CTG CTC ATA CAA GCG GAG CAG ACT TCT 3076 
Thr Cys Pro Cys Asp Thr Arg Leu Leu He Gin Ala Glu Gin Thr Ser 
1000 1005 1010 

GAC GGT CCA AAT CCT TGT GAC ATG GTT AAG CAA CCT AGA TAC CGA AAA 3124 
Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin Pro Arg Tyr Arg Lys 
1015 1020 1025 1030 

GGG CCT GAT GTC TGC TTT GAT AAC AAT GTC TTG GAG GAT TAT ACT GAC 3172 
Glv Pro Asp Val Cys Phe Asp Asn Asn Val Leu Glu Asp Tyr Thr Asp 
1035 1040 1045 

TGT GGT GGT GTT TCT GGA TTA AAT CCC TCC CTG TGG TAT ATC ATT GGA 3220 
Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu Trp Tyr He He Gly 
1050 * 1055 1060 

ATC CAG TTT CTA CTA CTT TGG CTG GTA TCT GGC AGC ACA CAC CGG CTG 3268 
He Gin Phe Leu Leu Leu Trp Leu Val Ser Gly Ser Thr His Arg Leu 
1065 1070 1075 

TTA TGACCTTCTA AAAACCAAAT CTGCATAGTT AAACTCCAGA CCCTGCCAAA 3321 
Leu 

ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCAGC TATAAAATCA GACAAACATT 3381 

AGCTGGGCCT GTTCCATGGC ATAACACTAA GGCGCAGACT CCTAAGGCAC CCACTGGCTG 3441 

CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG CATCATCTAT GTGTAACATC 3501 

AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG GCGTTTGTTG TTGCATTGTT 3561 

3564 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3289 

(D) OTHER INFORMATION: /standard_name= »Alpha-2e" 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3289.. 3579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 
Asn Gin Leu Val Asp lie Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu lie Ala Ala Arg Asp ' lie 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 no 115 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe lie 
1 35 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 
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CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 
His lie Pro Thr Asp lie Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 
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Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys Glu Asn 
395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 4 5 o 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Asn Pro Lys Ser 
520 525 530 

CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT 
Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp 
535 540 545 550 

ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA 
He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu 
555 560 565 

AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC 
Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp 
570 575 580 

AAA GGA AAC AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC 
Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr 
585 590 595 

AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC 
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Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr lie Lys Ala 
600 605 eio 

AAA CTA GAA GAG ACA ATA ACT CAG GCC AGA TAT TCG GAA ACC CTG AAG 
Lys Leu Glu Glu Thr He Thr Gin Ala Arg Tyr Ser Glu Thr Leu Lys 
615 620 625 630 

CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA CCA AGA GAT 
Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe He Ala Pro Arg Asp 
635 640 645 

TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT CTT TTA 
Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe Leu Leu 
650 655 660 

AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA TCA TGT 
Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys 
665 670 675 

AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT ACA AAT 
Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly Phe Thr Asn 
680 685 690 

GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC AAG GGA GTG 
Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He Lys Gly Val 
695 700 705 710 

AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT CCC 
Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg Val Tyr Pro 
715 720 725 

AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT GAG GAC 
Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp 
730 735 740 

AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC ACT GCT 
Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala 
745 750 755 

CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC ATT ATG 
Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly He Met 
760 765 770 

GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT AAA CCT 
Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro 
775 780 785 790 

GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG AAT TTC 
Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He Glu Asn Phe 
795 800 805 

ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT GAC TGC 
Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys 
810 815 B20 

AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT GGT GGG 
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Lys Arg Asn Ser Asp Val Met Asp Cys Val lie Leu Asp Asp Gly Gly 
825 830 835 

TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG ATT GGA AGA 
Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin lie Gly Ara 
840 845 850 

TTT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG GTT AAT ATA 
Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu Val Asn He 
855 860 865 870 

TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA TGT GAG 
Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val Cys Glu 
875 880 885 

CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA TAT GTG 
Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val 
890 895 900 

CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT GCT GCT 
Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala Thr Ala Ala 
905 910 915 

GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT CCA CGA 
Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Ara 
920 925 930 

CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG GCC TCC CTG 
Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu 
935 940 945 950 

TCC AAG CAG AGC TGC ATT ACT GAR CAA ACC CAG TAT TTC TTC GAT AAC 
Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe Asp Asn 
955 960 965 

GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT TCC AGA 
Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg 
970 975 980 

ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC ATA ATG 
He Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He Phe He Met 
985 990 995 

GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC ATA CAA 
Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu He Gin 
1000 1005 1010 

GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT AAG CAA 
Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin 
1C) 15 1020 1025 1030 

CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT GTC TTG 
Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn Asn Val Leu 
1035 1040 1045 

GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT CCC TCC CTG 
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Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu 
1050 1055 1060 

TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG GTA TCT GGC 3268 
Trp Tyr lie lie Gly lie Gin Phe Leu Leu Leu Trp Leu Val Ser Gly 
1065 1070 1075 

AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACCAAAT CTG C ATAGTT 3316 
Ser Thr His Arg Leu Leu 

1080 108 

AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCAGC 3376 

TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA GGCGCAGACT 3436 

CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG 34 96 

CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG 3556 

GCGTTTGTTG TTGCATTGTT GGT 3579 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1437 

(D) OTHER INFORMATION: /standard_name= "Beta-1- 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1435.. 1681 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
15 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu He Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
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ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 ' 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 go 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
10 ° 105 no 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
US 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 * 1S0 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 
ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GGT AAT GAA ATG ACT AAC TTA GCC TTT GAA CTA GAC CCC CTA GAG 
Ser Gly Asn Glu Met Thr Asn Leu Ala Phe Glu Leu Asp Pro Leu Glu 
210 215 220 

TTA GAG GAG GAA GAG GCT GAG CTT GGT GAG CAG AGT GGC TCT GCC AAG 
Leu Glu Glu Glu Glu Ala Glu Leu Gly Glu Gin Ser Gly Ser Ala Lys 
225 230 235 240 

ACT AGT GTT AGC AGT GTC ACC ACC CCG CCA CCC CAT GGC AAA CGC ATC 
Thr Ser Val Ser Ser Val Thr Thr Pro Pro Pro His Gly Lys Arg He 
245 250 255 

CCC TTC TTT AAG AAG ACA GAG CAT GTG CCC CCC TAT GAC GTG GTG CCT 
Pro Phe Phe Lys Lys Thr Glu His Val Pro Pro Tyr Asp Val Val Pro 
260 265 270 
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TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG GGC TAC GAG 864 
Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu 
275 280 285 

GTT ACA GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG AAG CAT CGG 912 
Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His Arg 
290 295 300 

TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT ATT TCC CTG 960 
Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp He Ser Leu 
305 310 315 320 

GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC ATC ATT GAG 1008 
Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He He He Glu 
325 330 335 

CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT GAA ATC GAG 1056 
Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser Glu He Glu 
340 345 350 

CGA ATC TTC GAG CTG GCC CGG ACC CTT CAG TTG GTC GCT CTG GAT GCT 1104 
Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Ala Leu Asp Ala 
355 360 365 

GAC ACC ATC AAT CAC CCA GCC CAG CTG TCC AAG ACC TCG CTG GCC CCC 1152 
Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser Leu Ala Pro 
370 375 380 

ATC ATT GTT TAC ATC AAG ATC ACC TCT CCC AAG GTA CTT CAA AGG CTC 1200 
He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu Gin Arg Leu 
385 390 395 400 

ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT GTC CAA ATA 124 8 

He Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn Val Gin He 
405 410 415 

GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG TTT GAC ATC 1296 
Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met Phe Asp He 
420 425 430 

ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT CTG GCG GAG 1344 
He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala Glu 
435 440 445 

TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC AGC ACG CCA 13 92 

Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser Ser Thr Pro 
450 455 460 

CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC CTG GCT 1437 
Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala Leu Ala 
465 470 475 

GCCAGCCCTG CCCCTGTCTC CAACCTCCAG GTACAGGTGC TCACCTCGCT CAGGAGAAAC 14 97 
CTCGGCTTCT GGGGCGGGCT GGAGTCCTCA CAGCGGGGCA GTGTGGTGCC CCAGGAGCAG 1557 
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GAACATGCCA TGTAGTGGGC GCCCTGCCCG TCTTCCCTCC TGCTCTGGGG TCGGAACTGG 1617 
AGTGCAGGGA ACATGGAGGA GGAAGGGAAG AGCTTTATTT TGTAAAAAAA TAAGATGAGC 1677 
GGCA 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..651 

(D) OTHER INFORMATION: /standard_name= "Beta- 1-4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
1 5 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu lie Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 " 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 
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GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 432 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

•TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp He Gly Arg Leu, Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 



AGT GAC AGA GCA TGT GCC CCC CTA TGACGTGGTG CCTTCCATGA GGCCCATCAT 678 
Ser Asp Arg Ala Cys Ala Pro Leu 
210 215 



CCTGGTGGGA 


CCGTCGCTCA 


AGGG CTACG A 


GGTTACAGAC 


ATGATGCAGA AAGCTTTATT 


738 


TGACTTCTTG 


AAGCATCGGT 


TTGATGGCAG 


GATCTCCATC 


ACTCGTGTGA 


CGGCAGATAT 


798 


TTCCCTGGCT 


AAGCGCTCAG 


TTCTCAACAA 


CCCCAGCAAA 


CACATCATCA 


TTGAGCGCTC 


858 


CAACACACGC 


TCCAGCCTGG 


CTGAGGTGCA 


GAGTGAAATC 


GAGCGAATCT 


TCGAGCTGGC 


916 


CCGGACCCTT 


CAGTTGGTCG 


CTCTGGATGC 


TGACACCATC 


AATCACCCAG 


CCCAGCTGTC 


978 


CAAGACCTCG 


CTGGCCCCCA 


TCATTGTTTA 


CATCAAGATC 


ACCTCTCCCA 


AGGTACTTCA 


1038 


AAGGCTCATC 


AAGTCCCGAG 


GAAAGTCTCA 


GTCCAAACAC 


CTCAATGTCC 


AAATAGCGGC 


1098 


CTCGGAAAAG 


CTGGCACAGT 


GCCCCCCTGA 


AATGTTTGAC 


ATCATCCTGG 


ATGAGAACCA 


1158 


ATTGGAGGAT 


GCCTGCGAGC 


ATCTGGCGGA 


GTACTTGGAA 


GCCTATTGGA 


AGGCCACACA 


1218 


CCCGCCCAGC 


AGCACGCCAC 


CCAATCCGCT 


GCTGAACCGC 


ACCATGGCTA 


CCGCAGCCCT 


1278 


GGCTGCCAGC 


CCTGCCCCTG 


TCTCCAACCT 


CCAGGTACAG 


GTGCTCACCT 


CGCTCAGGAG 


1338 


AAACCTCGGC 


TTCTGGGGCG 


GGCTGGAGTC 


CTCACAGCGG 


GGCAGTGTGG 


TGCCCCAGGA 


1398 


GCAGGAACAT 


GCCATGTAGT 


GGGCGCCCTG 




CTCCTGCTCT 


GGGGT CGGAA 


1458 


CTGGAGTGCA 


GGGAACATGG 


AGGAGGAAGG 


GAAGAGCTTT 


ATTTTGTAAA AAAATAAGAT 


1518 


GAG CGG C A 












1526 



(2) INFORMATION FOR SEQ ID NO: 35: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .660 

(D) OTHER INFORMATION: /standard_name= »Beta-l-5 •' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

mI? S*? ^ G f* 0 ACC AGC ATG TCC CGG GGG CCT TAC CCA CCC TCC CAG 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Sn 

5 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu lie Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly LyJ Tyf £er 
20 25 30 

ii2 A*™ 2?° CGA TTC *** CGG TCA GAT GGG AG ^ ACQ TCC TCG GAT 

Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
J3 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 
Thr Thr ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Jyf ?Sr 

S2r SI c CA GAC o CT GAT GTA TCT CTG GAG GAG GAC C< 3G GAA GCC 

Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 

£J £a SJS rt^ n? A GAG CGC £?° GCA TOA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 no 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 leo 
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AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 " 185 19° 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GGT TAC AGA CAT GAT GCA GAA AGC TTT ATT TGACTTCTTG AAGCATCGGT 677 
Ser Gly Tyr Arg His Asp Ala Glu Ser Phe He 

210 215 220 

TTGATGGCAG GATCTCCATC ACTCGTGTGA CGGCAGATAT TTCCCTGGCT AAGCGCTCAG 73 7 

TTCTCAACAA CCCCAGCAAA CACATCATCA TTGAGCGCTC CAACACACGC TCCAGCCTGG 797 

CTGAGGTGCA GAGTGAAATC GAGCGAATCT TCGAGCTGGC CCGGACCCTT CAGTTGGTCG 857 

CTCTGGATGC TGACACCATC AATCACCCAG CCCAGCTGTC CAAGACCTCG CTGGCCCCCA 917 

TCATTGTTTA CATCAAGATC ACCTCTCCCA AGGTACTTCA AAGGCTCATC AAGTCCCGAG 977 

GAAAGTCTCA GTCCAAACAC CTCAATGTCC AAATAGCGGC CTCGGAAAAG CTGGCACAGT 1037 

GCCCCCCTGA AATGTTTGAC ATCATCCTGG ATGAGAACCA ATTGGAGGAT GCCTGCGAGC 1097 

ATCTGGCGGA GTACTTGGAA GCCTATTGGA AGGCCACACA CCCGCCCAGC AGCACGCCAC 1157 

CCAATCCGCT GCTGAACCGC ACCATGGCTA CCGCAGCCCT GGCTGCCAGC CCTGCCCCTG 1217 

TCTCCAACCT CCAGGTACAG GTGCTCACCT CGCTCAGGAG AAACCTCGGC TTCTGGGGCG 12 77 

GGCTGGAGTC CTCACAGCGG GGCAGTGTGG TGCCCCAGGA GCAGGAACAT GCCATGTAGT 1337 

GGGCGCCCTG CCCGTCTTCC CTCCTGCTCT GGGGTCGGAA CTGGAGTGCA GGGAAC 13 93 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 226.. 6642 „*■,„>,. 1P ,» 
(D) OTHER INFORMATION: /standard_name= "Alpna-l<_-<2 
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180 
234 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CTCGAGGAGG CAGTAGTGGA AAGGAGCAGT TTTTGGGGTT TGATGCCATA ATGGGAATCA 

GGTAATCGTC GGCGGGGAAG AAGAAACGCT GCAGACCACG GCTTCCTCGA ATCTTGCGCG 

AAAGCCGCCG GCCTCGGAGG AGGGATTAAT CCAGACCCGC CGGGGGGTGT TTTCACATTT 

CTTCCTCTTC GTGGCTGCTC CTCCTATTAA AACCATTTTT GGTCC ATG GTC AAT 

Met Val Asn 
1 

rlZ ^3 £u G AGG ATG TAC ATT CCA GAG GAA AAC CAC CAA GGT TCC AAC 282 
Glu Asn Thr Arg Met Tyr He Pro Glu Glu Asn His Gin Gly Ser Asn 
5 10 is 

™l GGG AGC CCA C GC CCC GCC CAT GCC AAC ATG AAT GCC AAT GCG GCA 33 0 

Tyr Gly Ser Pro Arg Pro Ala His Ala Asn Met Asn Ala Asn Ala Ala 
25 30 35 

GCG GGG CTG GCC CCT GAG CAC ATC CCC ACC CCG GGG GCT GCC CTG TCG 3 78 

Ala Gly Leu Ala Pro Glu His He Pro Thr Pro Gly Ala Ala Leu Ser 
40 45 so 

£2 G^n S?2 f?f tT C GAC GCA G f° CGG ^ GCT **° CTG ATG GGC AGC 426 
Trp Gin Ala Ala He Asp Ala Ala Arg Gin Ala Lys Leu Met Gly Ser 
55 60 65 

21 r?S GCG A ? C TCC ACA GTC AGC TCC ACG »° GGG AAG CGG 474 

Ala Gly Asn Ala Thr He Ser Thr Val Ser Ser Thr Gin Arg Lys Arg 
70 75 80 

Sfn Gin *** CCC ^ ^ ^ GGC AGC ACC ACG GCC AGA CGC 522 

Gin Gin Tyr Gly Lys Pro Lys Lys Gin Gly Ser Thr Thr Ala Thr Arq 
85 go 95 

Pro P^o GCC x CTG CTC TGC CTG ACC CTG **» ^ CCC A TC CGG AGG 570 

Pro Pro Arg Ala Leu Leu Cys Leu Thr Leu Lys Asn Pro He Arg Arg 
100 105 HO us 

GCC TGC ATC AGC ATT GTC GAA TGG AAA CCA TTT GAA ATA ATT ATT TTA 618 
Ala Cys He Ser He Val Glu Trp Lys Pro Phe Glu He He He Leu 
12° 125 130 

CTG ACT ATT TTT GCC AAT TGT GTG GCC TTA GCG ATC TAT ATT CCC TTT 666 
Leu Thr He Phe Ala Asn Cys Val Ala Leu Ala He Tyr He Pro Phe 
135 140 145 

CCA GAA GAT GAT TCC AAC GCC ACC AAT TCC AAC CTG GAA CGA GTG GAA 714 
Pro Glu Asp Asp Ser Asn Ala Thr Asn Ser Asn Leu Glu Arg Val Glu 
150 155 160 
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GCC TAT GGA CTC CTC TTT CAC CCC AAT GCC TAC CTC CGC AAC GGC TGG 
Ala Tyr Gly Leu Leu Phe His Pro Asn Ala Tyr Leu Arg Asn Gly Trp 
180 185 190 195 

AAC CTA CTA GAT TTT ATA ATT GTG GTT GTG GGG CTT TTT AGT GCA ATT 
Asn Leu Leu Asp Phe He He Val Val Val Gly Leu Phe Ser Ala He 
200 205 210 

TTA GAA CAA GCA ACC AAA GCA GAT GGG GCA AAC GCT CTC GGA GGG AAA 
Leu Glu Gin Ala Thr Lys Ala Asp Gly Ala Asn Ala Leu Gly Gly Lys 
215 220 225 

GGG GCC GGA TTT GAT GTG AAG GCG CTG AGG GCC TTC CGC GTG CTG CGC 
Gly Ala Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg 
230 " 235 240 

CCC CTG CGG CTG GTG TCC GGA GTC CCA AGT CTC CAG GTG GTC CTG AAT 
Pro Leu Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val Val Leu Asn 
245 250 255 

TCC ATC ATC AAG GCC ATG GTC CCC CTG CTG CAC ATC GCC CTG CTT GTG 
Ser He He Lys Ala Met val Pro Leu Leu His He Ala Leu Leu Val 
260 265 270 275 

CTG TTT GTC ATC ATC ATC TAC GCC ATC ATC GGC TTG GAG CTC TTC ATG 
Leu Phe Val He He He Tyr Ala He He Gly Leu Glu Leu Phe Met 
280 " 285 290 

GGG AAG ATG CAC AAG ACC TGC TAC AAC CAG GAG GGC ATA GCA GAT GTT 
Gly Lys Met His Lys Thr Cys Tyr Asn Gin Glu Gly He Ala Asp Val 
295 300 305 

CCA GCA GAA GAT GAC CCT TCC CCT TGT GCG CTG GAA ACG GGC CAC GGG 
Pro Ala Glu Asp Asp Pro Ser Pro Cys Ala Leu Glu Thr Gly His Gly 
310 315 320 

CGG CAG TGC CAG AAC GGC ACG GTG TGC AAG CCC GGC TGG GAT GGT CCC 
Arg Gin Cys Gin Asn Gly Thr Val Cys Lys Pro Gly Trp Asp Gly Pro 
325 330 335 

AAG CAC GGC ATC ACC AAC TTT GAC AAC TTT GCC TTC GCC ATG CTC ACG 
Lys His Gly He Thr Asn Phe Asp Asn Phe Ala Phe Ala Met Leu Thr 
340 345 350 355 

GTG TTC CAG TGC ATC ACC ATG GAG GGC TGG ACG GAC GTG CTG TAC TGG 
Val Phe Gin Cys He Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp 
360 365 370 

GTC AAT GAT GCC GTA GGA AGG GAC TGG CCC TGG ATC TAT TTT GTT ACA 
Val Asn Asp Ala Val Gly Arg Asp Trp Pro Trp He Tyr Phe Val Thr 
375 380 385 

CTA ATC ATC ATA GGG TCA TTT TTT GTA CTT AAC TTG GTT CTC GGT GTG 
Leu He He He Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val 
390 395 400 
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Leu Ser S?> *** GAG AGG GAG ^ GCC GCC GGA 

Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala Lys Ala Arg Gly 
410 415 * 

a — ££ C J£° CTG CGG GAG CAG GAG CTA GAA GAG GAT CTC AAA 

Asp Phe Gin Lys Leu Arg Glu Lys Gin Gin Leu Glu Glu Asp Leu Lyt 
425 430 435 

S? C If 0 t C TG GAT TGG ATC ACT CAG GCC GAA GAC ATC GAT CCT GAG AAT 
Gly Tyr Leu Asp Trp H e Thr Gin Ala Glu Asp lie As? p" S2 A^n 
440 445 450 

GAC GAA S? C ATG GAT GAG GAG ccc CGA AAC ATG AGC ATG CCC 

Glu Asp Glu Gly Met Asp Glu Glu Lys Pro Arg Asn Met Ser Met Pro 
455 460 465 

ihr stl r AG ACC GAG TCC GTC ^ ACC GAA GTG GCT GGA GGT GAC 

Thr ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ala Gly Gly Asp 

ATC GAG GGA GAA AAC TGC GGG GCC AGG CTG GCC CAC CGG ATC TCC AAG 
He Glu Gly Glu Asn Cys Gly Ala Arg Leu Ala His i£g S sS £s 
* a:> 490 495 

TCA AAG TTC AGC CGC TAC TGG CGC CGG TGG AAT CGG TTC TGC AGA AGG 
Ser Lys Phe Ser Arg Tyr Trp Arg Arg Trp Asn Sg £S lys J£ J£ 

£y"s £S A^o S C f ? CA v TC TCT ** T GTC TOC TAC TGG CTG GTG ATT 

Lys Cys Arg Ala Ala Val Lys Ser Asn Val Phe Tyr Trp Leu Val He 

520 525 530 

TTC CTG GTG TTC CTC AAC ACG CTC ACC ATT GCC TCT GAG CAC TAC AAC 
Phe Leu val Phe Leu Asn Thr Leu Thr He Ala Ser Glu His Tyr Jin 
535 540 545 

Gin P^ C AAC l GG T CTC ACA GAA GTC GAC ACG GCA AAC AAG GCC CTG 

Gin Pro Asn Trp Leu Thr Glu Val Gin Asp Thr Ala Asn Lys Ala Leu 
550 555 5 S0 

Su A C ° t C I G ll C GCA GAG ATG CTC CTG *"* ATG TAC A °C CTG GGC 

Leu Ala Leu Phe Thr Ala Glu Met Leu Leu Lys Met Tyr Ser Leu Glv 
565 570 575 

CTG CAG GCC TAC TTC GTG TCC CTC TTC AAC CGC TTT GAC TGC TTC GTC 
Leu Gin Ala Tyr Phe Val Ser Leu Phe Asn Arg Phe Asp Cys Phe Val 
580 585 590 595 

SI? Ill riZ A T° T CTG S AG ACC ATC CTG GTG GAG ACC A** ATC ATG 

Val Cys Gly Gly He Leu Glu Thr He Leu Val Glu Thr Lys He Met 
600 605 610 

TCC CCA CTG GGC ATC TCC GTG CTC AGA TGC GTC CGG CTG CTG AGG ATT 
Ser Pro Leu Gly He Ser Val Leu Arg Cys Val Arg Leu Leu Arg 111 
615 620 625 
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TTC AAG ATC ACG AGG TAC TGG AAC TCC TTG AGC AAC CTG GTG GCA TCC 2154 
Phe Lys lie Thr Arg Tyr Trp Asn Ser Leu Ser Asn Leu Val Ala Ser 
630 635 640 

TTG CTG AAC TCT GTG CGC TCC ATC GCC TCC CTG CTC CTT CTC CTC TTC 2202 
Leu Leu Asn Ser Val Arg Ser He Ala Ser Leu Leu Leu Leu Leu Phe 
645 650 655 

CTC TTC ATC ATC ATC TTC TCC CTC CTG GGG ATG CAG CTC TTT GGA GGA 2250 
Leu Phe He He He Phe Ser Leu Leu Gly Met Gin Leu Phe Gly Gly 
660 665 670 675 

AAG TTC AAC TTT GAT GAG ATG CAG ACC CGG AGG AGC ACA TTC GAT AAC 2298 
Lys Phe Asn Phe Asp Glu Met Gin Thr Arg Arg Ser Thr Phe Asp Asn 
680 685 690 

TTC CCC CAG TCC CTC CTC ACT GTG TTT CAG ATC CTG ACC GGG GAG GAC 2346 
Phe Pro Gin Ser Leu Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp 
695 700 705 

TGG AAT TCG GTG ATG TAT GAT GGG ATC ATG GCT TAT GGC GGC CCC TCT 2394 
Trp Asn Ser Val Met Tyr Asp Gly He Met Ala Tyr Gly Gly Pro Ser 
710 715 720 

TTT CCA GGG ATG TTA GTC TGT ATT TAC TTC ATC ATC CTC TTC ATC TGT 2442 
Phe Pro Gly Met Leu Val Cys He Tyr Phe He He Leu Phe He Cys 
725 730 735 

GGA AAC TAT ATC CTA CTG AAT GTG TTC TTG GCC ATT GCT GTG GAC AAC 24 90 

Gly Asn Tyr He Leu Leu Asn Val Phe Leu Ala lie Ala Val Asp Asn 
740 ' 745 750 755 

CTG GCT GAT GCT GAG AGC CTC ACA TCT GCC CAA AAG GAG GAG GAA GAG 253 8 

Leu Ala Asp Ala Glu Ser Leu Thr Ser Ala Gin Lys Glu Glu Glu Glu 
760 765 770 

GAG AAG GAG AGA AAG AAG CTG GCC AGG ACT GCC AGC CCA GAG AAG AAA 2586 
Glu Lys Glu Arg Lys Lys Leu Ala Arg Thr Ala Ser Pro Glu Lys Lys 
775 780 785 

CAR GAG TTG GTG GAG AAG CCG GCA GTG GGG GAA TCC AAG GAG GAG AAG 2634 
Gin Glu Leu Val Glu Lys Pro Ala Val Gly Glu Ser Lys Glu Glu Lys 
790 795 800 

ATT GAG CTG AAA TCC ATC ACG GCT GAC GGA GAG TCT CCA CCC GCC ACC 2682 
He Glu Leu Lys Ser He Thr Ala Asp Gly Glu Ser Pro Pro Ala Thr 
805 * 810 815 

AAG ATC AAC ATG GAT GAC CTC CAG CCC AAT GAA AAT GAG GAT AAG AGC 273 0 

Lys He Asn Met Asp Asp Leu Gin Pro Asn Glu Asn Glu Asp Lys Ser 
820 825 830 835 

CCC TAC CCC AAC CCA GAA ACT ACA GGA GAA GAG GAT GAG GAG GAG CCA 2778 
Pro Tyr Pro Asn Pro Glu Thr Thr Gly Glu Glu Asp Glu Glu Glu Pro 
840 845 850 
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GAG ATG CCT GTC GGC CCT CGC CCA CGA CCA CTC TCT GAG CTT CAC CTT 
Glu Met Pro Val Gly Pro Arg Pro Arg Pro Leu Ser Glu Leu His Leu 
855 860 865 

AAG GAA AAG GCA GTG CCC ATG CCA GAA GCC AGC GCG TTT TTC ATC TTC 
Lys Glu Lys Ala Val Pro Met Pro Glu Ala Ser Ala Phe Phe He Phe 
870 875 880 

AGC TCT AAC AAC AGG TTT CGC CTC CAG TGC CAC CGC ATT GTC AAT GAC 
Ser Ser Asn Asn Arg Phe Arg Leu Gin Cys His Arg He Val Asn Asp 
885 890 895 

ACG ATC TTC ACC AAC CTG ATC CTC TTC TTC ATT CTG CTC AGC AGC ATT 
Thr He Phe Thr Asn Leu He Leu Phe Phe He Leu Leu Ser Ser He 
900 90s 910 91S 

TCC CTG GCT GCT GAG GAC CCG GTC CAG CAC ACC TCC TTC AGG AAC CAT 
Ser Leu Ala Ala Glu Asp Pro Val Gin His Thr Ser Phe Arg Asn His 
920 925 930 

ATT CTG TTT TAT TTT GAT ATT GTT TTT ACC ACC ATT TTC ACC ATT GAA 
He Leu Phe Tyr Phe Asp He Val Phe Thr Thr He Phe Thr He Glu 
935 940 945 

ATT GCT CTG AAG ATG ACT GCT TAT GGG GCT TTC TTG CAC AAG GGT TCT 
He Ala Leu Lys Met Thr Ala Tyr Gly Ala Phe Leu His Lys Gly Ser 
950 955 960 

TTC TGC CGG AAC TAC TTC AAC ATC CTG GAC CTG CTG GTG GTC AGC GTG 
Phe Cys Arg Asn Tyr Phe Asn He Leu Asp Leu Leu Val Val Ser Val 
965 970 975 

TCC CTC ATC TCC TTT GGC ATC CAG TCC AGT GCA ATC AAT GTC GTG AAG 
Ser Leu He Ser Phe Gly He Gin Ser Ser Ala He Asn Val Val Lys 
980 985 990 995 

ATC TTG CGA GTC CTG CGA GTA CTC AGG CCC CTG AGG GCC ATC AAC AGG 
He Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg 
1000 1005 " 1010 

GCC AAG GGG CTA AAG CAT GTG GTT CAG TGT GTG TTT GTC GCC ATC CGG 
Ala Lys Gly Leu Lys His Val Val Gin Cys Val Phe Val Ala He Arg 
1015 1020 1025 

ACC ATC GGG AAC ATC GTG ATT GTC ACC ACC CTG CTG CAG TTC ATG TTT 
Thr He Gly Asn He Val He Val Thr Thr Leu Leu Gin Phe Met Phe 
1030 1035 1040 

GCC TGC ATC GGG GTC CAG CTC TTC AAG GGA AAG CTG TAC ACC TGT TCA 
Ala Cys He Gly Val Gin Leu Phe Lys Gly Lys Leu Tyr Thr Cys Ser 
1045 1050 1055 

GAC AGT TCC AAG CAG ACA GAG GCG GAA TGC AAG GGC AAC TAC ATC ACG 
Asp Ser Ser Lys Gin Thr Glu Ala Glu Cys Lys Gly Asn Tyr He Thr 
1060 1065 1070 " 1075 



WO 95/04822 



PCT/US94/09230 



-265- 



TAC AAA GAC GGG GAG GTT GAC CAC CCC ATC ATC CAA CCC CGC AGC TGG 
Tyr Lys Asp Gly Glu Val Asp His Pro lie He Gin Pro Arg Ser Trp 
1080 1085 1090 

GAG AAC AGC AAG TTT GAC TTT GAC AAT GTT CTG GCA GCC ATG ATG GCC 
Glu Asn Ser Lys Phe Asp Phe Asp Asn Val Leu Ala Ala Met Met Ala 
1095 1100 1105 

CTC TTC ACC GTC TCC ACC TTC GAA GGG TGG CCA GAG CTG CTG TAC CGC 
Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Glu Leu Leu Tyr Arg 
1110 1115 1120 

TCC ATC GAC TCC CAC ACG GAA GAC AAG GGC CCC ATC TAC AAC TAC CGT 
Ser He Asp Ser His Thr Glu Asp Lys Gly Pro He Tyr Asn Tyr Arg 
1125 1130 1135 

GTG GAG ATC TCC ATC TTC TTC ATC ATC TAC ATC ATC ATC ATC GCC TTC 
Val Glu He Ser He Phe Phe He He Tyr He He He He Ala Phe 
1140 1145 1150 1155 

TTC ATG ATG AAC ATC TTC GTG GGC TTC GTC ATC GTC ACC TTT CAG GAG 
Phe Met Met Asn He Phe Val Gly Phe Val He Val Thr Phe Gin Glu 
1160 1165 1170 

CAG GGG GAG CAG GAG TAC AAG AAC TGT GAG CTG GAC AAG AAC CAG CGA 
Gin Gly Glu Gin Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg 
1175 1180 1165 

CAG TGC GTG GAA TAC GCC CTC AAG GCC CGG CCC CTG CGG AGG TAC ATC 
Gin Cys Val Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He 
1190 1195 1200 

CCC AAG AAC CAG CAC CAG TAC AAA GTG TGG TAC GTG GTC AAC TCC ACC 
Pro Lys Asn Gin His Gin Tyr Lys Val Trp Tyr Val Val Asn Ser Thr 
1205 1210 1215 

TAC TTC GAG TAC CTG ATG TTC GTC CTC ATC CTG CTC AAC ACC ATC TGC 
Tyr Phe Glu Tyr Leu Met Phe Val Leu lie Leu Leu Asn Thr He Cys 
1220 1225 1230 1235 

CTG GCC ATG CAG CAC TAC GGC CAG AGC TGC CTG TTC AAA ATC GCC ATG 
Leu Ala Met Gin His Tyr Gly Gin Ser Cys Leu Phe Lys He Ala Met 
1240 1245 1250 

AAC ATC CTC AAC ATG CTC TTC ACT GGC CTC TTT ACC GTG GAG ATG ATC 
Asn He Leu Asn Met Leu Phe Thr Gly Leu Phe Thr Val Glu Met He 
1255 1260 1265 

CTG AAG CTC ATT GCC TTC AAA CCC AAG CAC TAT TTC TGT GAT GCA TGG 
Leu Lys Leu He Ala Phe Lys Pro Lys His Tyr Phe Cys Asp Ala Trp 
1270 1275 1280 

AAT ACA TTT GAC GCC TTG ATT GTT GTG GGT AGC ATT GTT GAT ATA GCA 
Asn Thr Phe Asp Ala Leu He Val Val Gly Ser He Val Asp He Ala 
1285 1290 1295 
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ATC ACC GAG GTA AAC CCA GCT GAA CAT ACC CAA TGC TCT CCC TCT ATG 4170 
He Thr Glu Val Asn Pro Ala Glu His Thr Gin Cys Ser Pro Ser Met 
1300 1305 1310 1315 

AAC GCA GAG GAA AAC TCC CGC ATC TCC ATC ACC TTC TTC CGC CTG TTC 4218 
Asn Ala Glu Glu Asn Ser Arg He Ser He Thr Phe Phe Arg Leu Phe 
1320 1325 1330 

CGG GTC ATG CGT CTG GTG AAG CTG CTG AGC CGT GGG GAG GGC ATC CGG 4266 
Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He Arg 
1335 1340 1345 

ACG CTG CTG TGG ACC TTC ATC AAG TCC TTC CAG GCC CTG CCC TAT GTG 4314 
Thr Leu Leu Trp Thr Phe He Lys Ser Phe Gin Ala Leu Pro Tyr Val 
1350 1355 1360 

GCC CTC CTG ATC GTG ATG CTG TTC TTC ATC TAC GCG GTG ATC GGG ATG 4362 
Ala Leu Leu He Val Met Leu Phe Phe He Tyr Ala Val He Gly Met 
1365 1370 1375 

CAG GTG TTT GGG AAA ATT GCC CTG AAT GAT ACC ACA GAG ATC AAC CGG 4410 
Gin Val Phe Gly Lys He Ala Leu Asn Asp Thr Thr Glu He Asn Arg 
1380 1365 1390 1395 

AAC AAC AAC TTT CAG ACC TTC CCC CAG GCC GTG CTG CTC CTC TTC AGG 4458 
Asn Asn Asn Phe Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe Arg 
1400 1405 1410 

TGT GCC ACC GGG GAG GCC TGG CAG GAC ATC ATG CTG GCC TGC ATG CCA 4 5 06 

Cys Ala Thr Gly Glu Ala Trp Gin Asp He Met Leu Ala Cys Met Pro 
1415 1420 1425 

GGC AAG AAG TGT GCC CCA GAG TCC GAG CCC AGC AAC AGC ACG GAG GGT 4 554 

Gly Lys Lys Cys Ala Pro Glu Ser Glu Pro Ser Asn Ser Thr Glu Gly 
1430 1435 1440 

GAA ACA CCC TGT GGT AGC AGC TTT GCT GTC TTC TAC TTC ATC AGC TTC 4602 
Glu Thr Pro Cys Gly Ser Ser Phe Ala Val Phe Tyr Phe He Ser Phe 
1445 1450 1455 

TAC ATG CTC TGT GCC TTC CTG ATC ATC AAC CTC TTT GTA GCT GTC ATC 465 0 

Tyr Met Leu Cys Ala Phe Leu He He Asn Leu Phe Val Ala Val He 
1460 1465 1470 1475 

ATG GAC AAC TTT GAC TAC CTG ACA AGG GAC TGG TCC ATC CTT GGT CCC 4698 
Met Asp Asn Phe Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro 
1480 1485 1490 

CAC CAC CTG GAT GAG TTT AAA AGA ATC TGG GCA GAG TAT GAC CCT GAA 4 746 

His His Leu Asp Glu Phe Lys Arg He Trp Ala Glu Tyr Asp Pro Glu 
1495 1500 ' 1505 

GCC AAG GGT CGT ATC AAA CAC CTG GAT GTG GTG ACC CTC CTC CGG CGG 4 794 

Ala Lys Gly Arg He Lys His Leu Asp Val Val Thr Leu Leu Arg Arg 
1510 1515 1520 
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ATT CAG CCG CCA CTA GGT TTT GGG AAG CTG TGC CCT CAC CGC GTG GCT 4842 
He Gin Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala 
1525 1530 1535 

TGC AAA CGC CTG GTC TCC ATG AAC ATG CCT CTG AAC AGC GAC GGG ACA 4 890 

Cys Lys Arg Leu Val Ser Met Asn Met Pro Leu Asn Ser Asp Gly Thr 
1540 1545 1550 1555 

GTC ATG TTC AAT GCC ACC CTG TTT GCC CTG GTC AGG ACG GCC CTG AGG 4 938 

Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Arg 
1560 1565 1570 

ATC AAA ACA GAA GGG AAC CTA GAA CAA GCC AAT GAG GAG CTG CGG GCG 4986 
He Lys Thr Glu Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala 
1575 1580 1585 

ATC ATC AAG AAG ATC TGG AAG CGG ACC AGC ATG AAG CTG CTG GAC CAG 5034 
He He Lys Lys He Trp Lys Arg Thr Ser Met Lys Leu Leu Asp Gin 
1590 1595 1600 

GTG GTG CCC CCT GCA GGT GAT GAT GAG GTC ACC GTT GGC AAG TTC TAC 5082 
Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr 
1605 1610 1615 

GCC ACG TTC CTG ATC CAG GAG TAC TTC CGG AAG TTC AAG AAG CGC AAA 5130 
Ala Thr Phe Leu He Gin Glu Tyr Phe Arg Lys Phe Lys Lys Arg Lys 
1620 1625 1630 1635 

GAG CAG GGC CTT GTG GGC AAG CCC TCC CAG AGG AAC GCG CTG TCT CTG 517 B 

Glu Gin Gly Leu Val Gly Lys Pro Ser Gin Arg Asn Ala Leu Ser Leu 
1640 1645 1650 

CAG GCT GGC TTG CGC ACA CTG CAT GAC ATC GGG CCT GAG ATC CGA CGG 5226 
Gin Ala Gly Leu Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg 
1655 1660 1665 

GCC ATC TCT GGA GAT CTC ACC GCT GAG GAG GAG CTG GAC AAG GCC ATG 5274 
Ala He Ser Gly Asp Leu Thr Ala Glu Glu Glu Leu Asp Lys Ala Met 
1670 1675 1680 

AAG GAG GCT GTG TCC GCT GCT TCT GAA GAT GAC ATC TTC AGG AGG GCC 5322 
Lys Glu Ala Val Ser Ala Ala Ser Glu Asp Asp He Phe Arg Arg Ala 
1685 1690 1695 

GGT GGC CTG TTC GGC AAC CAC GTC AGC TAC TAC CAA AGC GAC GGC CGG 537 0 

Gly Gly Leu Phe Gly Asn His Val Ser Tyr Tyr Gin Ser Asp Gly Arg 
1700 1705 1710 1715 

AGC GCC TTC CCC CAG ACC TTC ACC ACT CAG CGC CCG CTG CAC ATC AAC 5418 
Ser Ala Phe Pro Gin Thr Phe Thr Thr Gin Arg Pro Leu His He Asn 
1720 1725 1730 

AAG GCG GGC AGC AGC CAG GGC GAC ACT GAG TCG CCA TCC CAC GAG AAG 5466 
Lys Ala Gly Ser Ser Gin Gly Asp Thr Glu Ser Pro Ser His Glu Lys 
1735 1740 1745 



WO 95/04822 



PCT/US94/09230 



-268- 



CTG GTG GAC TCC ACC TTC ACC CCG AGC AGC TAC TCG TCC ACC GGC TCC 
Leu Val Asp Ser Thr Phe Thr Pro Ser Ser Tyr Ser Ser Thr Gly Ser 
1750 1755 1760 

AAC GCC AAC ATC AAC AAC GCC AAC AAC ACC GCC CTG GGT CGC CTC CCT 
Asn Ala Asn lie Asn Asn Ala Asn Asn Thr Ala Leu Gly Arg Leu Pro 
1765 1770 1775 

CGC CCC GCC GGC TAC CCC AGC ACG GTC AGC ACT GTG GAG GGC CAC GGG 
Arg Pro Ala Gly Tyr Pro Ser Thr Val Ser Thr Val Glu Gly His Gly 
17 80 1785 1790 1795 

CCC CCC TTG TCC CCT GCC ATC CGG GTG CAG GAG GTG GCG TGG AAG CTC 
Pro Pro Leu Ser Pro Ala He Arg Val Gin Glu Val Ala Trp Lys Leu 
1800 1805 1810 

AGC TCC AAC AGG TGC CAC TCC CGG GAG AGC CAG GCA GCC ATG GCG GGT 
Ser Ser Asn Arg Cys His Ser Arg Glu Ser Gin Ala Ala Met Ala Gly 
1815 1820 1825 

CAG GAG GAG ACG TCT CAG GAT GAG ACC TAT GAA GTG AAG ATG AAC CAT 
Gin Glu Glu Thr Ser Gin Asp Glu Thr Tyr Glu Val Lys Met Asn His 
1830 1835 1840 

GAC ACG GAG GCC TGC AGT GAG CCC AGC CTG CTC TCC ACA GAG ATG CTC 
Asp Thr Glu Ala Cys Ser Glu Pro Ser Leu Leu Ser Thr Glu Met Leu 
1845 1850 1855 

TCC TAC CAG GAT GAC GAA AAT CGG CAA CTG ACG CTC CCA GAG GAG GAC 
Ser Tyr Gin Asp Asp Glu Asn Arg Gin Leu Thr Leu Pro Glu Glu Asp 
I860 1865 1870 1875 

AAG AGG GAC ATC CGG CAA TCT CCG AAG AGG GGT TTC CTC CGC TCT GCC 
Lys Arg Asp He Arg Gin Ser Pro Lys Arg Gly Phe Leu Arg Ser Ala 
1880 1885 1890 

TCA CTA GGT CGA AGG GCC TCC TTC CAC CTG GAA TGT CTG AAG CGA CAG 
Ser Leu Gly Arg Arg Ala Ser Phe His Leu Glu Cys Leu Lys Arg Gin 
1895 1900 1905 

AAG GAC CGA GGG GGA GAC ATC TCT CAG AAG ACA GTC CTG CCC TTG CAT 
Lys Asp Arg Gly Gly Asp He Ser Gin Lys Thr Val Leu Pro Leu His 
1910 1915 1920 

CTG GTT CAT CAT CAG GCA TTG GCA GTG GCA GGC CTG AGC CCC CTC CTC 
Leu Val His His Gin Ala Leu Ala Val Ala Gly Leu Ser Pro Leu Leu 
1925 1930 1935 

CAG AGA AGC CAT TCC CCT GCC TCA TTC CCT AGG CCT TTT GCC ACC CCA 
Gin Arg Ser His Ser Pro Ala Ser Phe Pro Arg Pro Phe Ala Thr Pro 
1940 1945 1950 1955 

CCA GCC ACA CCT GGC AGC CGA GGC TGG CCC CCA CAG CCC GTC CCC ACC 
Pro Ala Thr Pro Gly Ser Arg Gly Trp Pro Pro Gin Pro Val Pro Thr 
I960 1965 1970 
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CTG 
Leu 


CGG 
Arg 


CTT 
Leu 


GAG GGG GTC GAG TCC 
Glu Gly Val Glu Ser 
1975 


AGT GAG 
Ser Glu 
1980 


AAA 
Lys 


CTC 
Leu 


AAC 
Asn 


AGC AGC 
Ser Ser 
1985 


Phi 


CCA 
Pro 


TCC 
Ser 


ATC CAC 
He His 
1990 


TGC GGC TCC TGG GCT 
Cys Gly Ser Trp Ala 
1995 


GAG 
Glu 


ACC 
Thr 


ACC 
Thr 


CCC GGT 
Pro Gly 
2000 


GGC GGG 
Gly Gly 


GGC 
Gly 


AGC AGC 
Ser Ser 
2005 


GCC 
Ala 


GCC CGG AGA GTC 
Ala Arg Arg Val 
2010 


CGG 
Arg 


CCC 
Pro 


GTC 
Val 


TCC CTC 
Ser Leu 
2015 


ATG 
Met 


GTG 
Val 


CCC 
Pro 


AGC CAG 
Ser Gin 
2020 


GCT 
Ala 


GGG 
Gly 


GCC CCA GGG AGG 
Ala Pro Gly Arg 
2025 


CAG 
Gin 


TTC 

Phe 


CAC GGC 
His Gly 
2030 


AGT 
Ser 


GCC 
Ala 


AGC 
Ser 


AGC 
Ser 
2035 


CTG 
Leu 


GTG 
Val 


GAA 
Glu 


GCG 
Ala 


GTC TTG ATT TCA 
Val Leu He Ser 
2040 


GAA 
Glu 


GGA CTG 
Gly Leu 
2045 


GGG 
Gly 


CAG 
Gin 


TTT 
Phe 


GCT CAA 
Ala Gin 
2050 


GAT 

Asp 


CCC 
Pro 


AAG 
Lys 


TTC ATC GAG GTC ACC 
Phe He Glu Val Thr 
2055 


ACC CAG 
Thr Gin 
2060 


GAG 
Glu 


CTG 
Leu 


GCC 
Ala 


GAC GCC 
Asp Ala 
2065 


TGC 

Cys 


GAC 
Asp 


ATG 
Met 


ACC ATA 
Thr He 
2070 


GAG GAG ATG GAG AGC 
Glu Glu Met Glu Ser 
2075 


GCG 
Ala 


GCC 
Ala 


GAC 
Asp 


AAC ATC 
Asn He 
2080 


CTC 
Leu 


AGC 
Ser 


GGG 
Gly 


GGC GCC 
Gly Ala 
2085 


CCA 
Pro 


CAG AGC CCC AAT 
Gin Ser Pro Asn 
2090 


GGC 
Gly 


GCC 
Ala 


CTC 


TTA CCC 
Leu Pro 
2095 


TTT 
Phe 


GTG 
Val 


AAC 
Asn 


TGC AGG 
Cys Arg 
2100 


GAC 
Asp 


GCG 
Ala 


GGG CAG GAC CGA 
Gly Gin Asp Arg 
2105 


GCC 
Ala 


GGG 
Gly 


GGC GAA 
Gly Glu 
2110 


GAG 
Glu 


GAC 
Asp 


GCG 
Ala 


GGC 
Gly 
2115 


TGT 

Cys 


GTG 
Val 


CGC 
Arg 


GCG 
Ala 


CGG GGT CGA CCG 
Arg Gly Arg Pro 
2120 


AGT 
Ser 


GAG GAG 
Glu Glu 
2125 


GAG 
Glu 


CTC 
Leu 


CAG 
Gin 


GAC AGC 
Asp Ser 
2130 



AGG GTC TAC GTC AGC AGC CTG TAGTGGGCGC TGCCAGATGC GGGCTTTTTT 
Arg Val Tyr Val Ser Ser Leu 
2135 

TTATTTGTTT CAATGTTCCT AATGGGTTCG TTTCAGAAGT GCCTCACTGT TCTCGT 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 502.. 2316 

(D> OTHER INFORMATION: /standard_name= "Beta-2C" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

CAGCAGCGTG CTAAGAAGCA GTCACATAAA CAGCAGCAGG AGTAGGCCTC CTGCTTTTCA 6 0 

AAAGCAGAGT ACTGCAGGGT CGCGAAATGC AAGACACTCA GATGTTTGAA AATCTCCCGA 120 

GTTGAGAATG GCTACTGTAA AAGCGTCACC AAGAAACTCT GACGATCTGG ACAGTCCTAA 180 

CTCTGTGTTA GCAATACTTA CTTCCGGAAA ATTAATGCTA CTTCTTGTAG ATTTTTGCAA 240 

ATAGGAAACC CCCTTGAAGA AGATCTCAAA TTACGCCCCC CACCCCCAAA AAAAGACAAA 300 

CAGGGGAGAA CAAAGTTTTG GCATGCCTGC AGGAACGGTG GCTTTTTTAG AAACTACCTA 360 

GGAGGCAGAA GCTAAGTGAT TTGCTCATGC CTCTTACCTG GGAGTAGAAG GTGGGAAGAA 420 

ATGGACCGAG GCTGTGACGA GAAGACAAGG CACAGTGCAG CTTGGTGAAG CCACACGCTG 480 

ACTGCGTTCT GCCCCCTCTT C ATG CAG TGC TGC GGG CTG GTG CAT CGC CGG 531 
Met Gin Cys Cys Gly Leu Val His Arg Arg 
15 10 

CGA GTA CGG GTG TCC TAT GGT TCG GCA GAC TCC TAC ACT AGC CGT CCA 579 
Arg Val Arg Val Ser Tyr Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro 
15 20 25 

TCC GAT TCC GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC AGA 627 
Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg Arg 
30 35 40 

GAA GCG GAG CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA AAG 675 
Glu Ala Glu Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr Lys 
45 50 55 

CCC GTT GCA TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC CAT 723 
Pro Val Ala Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala His 
60 65 70 

GAA GAT GAT GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA AAA 771 
Glu Asp Asp Val Pro Val Pro Gly Met Ala He Ser Phe Glu Ala Lys 
75 80 85 90 

GAT TTT CTG CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA GGG 819 
Asp Phe Leu His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp He Gly 
95 100 105 

CGA TTG GTA AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA GTC 867 
Arg Leu Val Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Val 
110 115 120 
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AAA CTA GAA AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA GGG 
Lys Leu Glu Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin Gly 
125 130 135 

AAA TTC TAC TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG GGT 
Lys Phe Tyr Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu Gly 
140 145 150 

GAC ATA GTA CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT. GCT ATA 
Asp lie Val Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala lie 
155 160 165 170 

GAC ATA GAT GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA GCA 
Asp lie Asp Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp lie Pro Ala 
175 180 185 

AAC CAC CGC TCC CCT AAA CCC AGT GCA AAC AGT GTA ACG TCA CCC CAC 
Asn His Arg Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro His 
190 195 200 

TCC AAA GAG AAA AGA ATG CCC TTC TTT AAG AAG ACA GAG CAC ACT CCT 
Ser Lys Glu Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr Pro 
205 210 215 

CCG TAT GAT GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GTG GGC CCT 
Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly Pro 
220 225 230 

TCT CTG AAG GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG TTT 
Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe 
235 240 245 250 

GAT TTT TTA AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG GTC 
Asp Phe Leu Lys His Arg Phe Glu Gly Arg lie Ser He Thr Arg Val 
255 260 265 

ACC GCT GAC ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC AGT 
Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser 
270 275 280 

AAG CAC GCA ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG GAA 
Lys His Ala He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu 
285 290 295 

GTT CAG AGT GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG CAG 
Val Gin Ser Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin 
300 305 310 

TTG GTG GTC CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC AGT 
Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser 
315 320 325 330 

AAA ACC TCC TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT CCT 
Lys Thr Ser Leu Ala Pro He He Val Tyr Val Lys He Ser Ser Pro 
335 340 345 
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AAG GTT TTA CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT AAA 158 7 

Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala Lys 
350 355 360 

CAC CTC AAC GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT CCT 1635 
His Leu Asn Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys Pro 
365 370 " 375 

CCA GAG CTG TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT GCC 1683 
Pro Glu Leu Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala 
380 385 390 

TGT GAG CAC CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC CAT 1731 
Cys Glu His Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His 
395 400 405 410 

CCT CCC AGC AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA GCC 1779 
Pro Pro Ser Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu Ala 
415 420 425 

ACT TCA AGT CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG GGT 1827 
Thr Ser Ser Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin Gly 
430 435 440 

TCT CAA GGT GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT GCT 1875 
Ser Gin Gly Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser Ala 
445 450 455 

TCC CAA GCT GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC CAG 1923 
Ser Gin Ala Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser Gin 
460 465 470 

CAC CGC TCT TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG ACA 1971 
His Arg Ser Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly Thr 
475 480 485 490 

AGT CGC GGC CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG GAG 2019 
Ser Arg Gly Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin Glu 
495 500 505 

AGT CGA GAC TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT GAC 2067 
Ser Arg Asp Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His Asp 
510 515 520 

CAC GTG GAC CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC GAG 2115 
His Val Asp His Tyr Ala Ser His Arg Asp His Asn His Arg Asp Glu 
525 530 535 

ACC CAC GGG AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT TCC 2163 
Thr His Gly Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg Ser 
540 545 550 

CGG GAC GTG GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG CGC 2211 
Arg Asp Val Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin Arg 
555 560 565 570 
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AGC CGT CAT AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA GTG 2259 
Ser Arg His Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu Val 
575 580 585 

ATA TCA AAA AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT TAC 23 0 7 

lie Ser Lys Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val Tyr 
590 595 600 

ATC CCC CAA TGAGTTTTGC CCTTTTGTGT TTTTTTTTTT TTTTTTTTGA 2356 
He Pro Gin 
605 

AGTCTTGTAT AACTAACAGC ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTGCAATC 2416 

ATATGTGATC TGTCTTGTAA TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG 2476 

GATAGAGTAT TGAGATACTT TTTCTTTTGT AAGTGCTACA TAAATTGGCC TGGTATGGCT 2536 

GCAGTCCTCC GGTTGCATAC TGGACTCTTC AAAAACTGTT TTGGGTAGCT GCCACTTGAA 25 96 

CAAAATCTGT TGCCACCCAG GTGATGTTAG TGTTTTAAGA AATGTAGTTG ATGTATCCAA 2656 

CAAGCCAGAA TCAGCACAGA TAAAAAGTGG AATTTCTTGT TTCTCCAGAT TTTTAATACG 2716 

TTAATACGCA GGCATCTGAT TTGCATATTC ATTCATGGAC CACTGTTTCT TGCTTGTACC 2776 

TCTGGCTGAC TAAATTTGGG GACAGATTCA GTCTTGCCTT ACACAAAGGG GATCATAAAG 2836 

TTAGAATCTA TTTTCTATGT ACTAGTACTG TGTACTGTAT AGACAGTTTG TAAATGTTAT 2896 

TTCTGCAAAC AAACACCTCC TTATTATATA TAATATATAT ATATATATCA GTTTGATCAC 2956 

ACTATTTTAG AGTC 2970 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 223.. 2061 

(D) OTHER INFORMATION: /standard_name= "Beta-2E" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
AGTGTGTGTT TTCAGCCCCT CCTGGAATGG GAAAATAAGA ATCTCCCTGG ATGGGAGTCC 60 
TCTGGGGCAG GGAGTGAAAG CCCCGGAGGC AGAAAGGGAC GGAGAACAGG GGCTTGCCCA 120 
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GAGCATGGAT AGGAAAGGAG CTGGGGTTCT CCGGGGCTCA GCGCGCACTG AGAACCTGTG 180 

CCCGGGGCTG CAGCTGCGGA CGATAAAGGC GCTGTCTGGC TC ATG AAG GCC ACC 234 

Met Lys Ala Thr 
1 

TGG ATC AGG CTT CTG AAA AGA GCC AAG GGA GGA AGG CTG AAG AAT TCT 282 
Trp lie Arg Leu Leu Lys Arg Ala Lys Gly Gly Arg Leu Lys Asn Ser 
5 10 15 20 

GAT ATC TGT GGT TCG GCA GAC TCC TAC ACT AGC CGT CCA TCC GAT TCC 33 0 

Asp lie Cys Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser 
25 30 35 

GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC AGA GAA GCG GAG 378 
Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg Arg Glu Ala Glu 
40 45 50 

CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA AAG CCC GTT GCA 426 
Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr Lys Pro Val Ala 
55 60 65 

TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC CAT GAA GAT GAT 474 
Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala His Glu Asp Asp 
70 75 80 

GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA AAA GAT TTT CTG 522 
Val Pro Val Pro Gly Met Ala He Ser Phe Glu Ala Lys Asp Phe Leu 
85 90 95 100 

CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA GGG CGA TTG GTA 570 
His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp He Gly Arg Leu Val 
105 no 115 

AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA GTC AAA CTA GAA 618 
Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Val Lys Leu Glu 
120 125 130 

AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA GGG AAA TTC TAC 666 
Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin Gly Lys Phe Tyr 
135 140 145 

TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG GGT GAC ATA GTA 714 
Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu Gly Asp He Val 
150 155 160 

CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT ATA GAC ATA GAT 762 
Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala He Asp He Asp 
165 170 175 180 

GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA GCA AAC CAC CGC 810 
Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro Ala Asn His Arg 
185 190 195 

TCC CCT AAA CCC AGT GCA AAC AGT GTA ACG TCA CCC CAC TCC AAA GAG 858 
Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro His Ser Lys Glu 
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AAA AGA ATG CCC TTC TTT AAG AAG ACA GAG CAC ACT CCT CCG TAT GAT 
Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr Pro Pro Tyr Asp 
215 220 225 

GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GTG GGC CCT TCT CTG AAG 
Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly Pro Ser Leu Lys 
230 235 240 

GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG TTT GAT TTT TTA 
Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 260 

AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG GTC ACC GCT GAC 
Lys His Arg Phe Glu Gly Arg He Ser He Thr Arg Val Thr Ala Asp 
265 270 275 

ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC AGT AAG CAC GCA 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His Ala 
280 285 290 

ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG GAA GTT CAG AGT 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
295 300 305 

GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG CAG TTG GTG GTC 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Val 
310 " 315 320 

CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC AGT AAA ACC TCC 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 340 

TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT CCT AAG GTT TTA 
Leu Ala Pro He He Val Tyr Val Lys He Ser Ser Pro Lys Val Leu 
345 ~ 350 355 

CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT AAA CAC CTC AAC 
Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala Lys His Leu Asn 
360 365 370 

GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT CCT CCA GAG CTG 
Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys Pro Pro Glu Leu 
375 380 385 

TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT GCC TGT GAG CAC 
Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
390 395 400 

CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC CAT CCT CCC AGC 
Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 420 

AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA GCC ACT TCA AGT 
Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu Ala Thr Ser Ser 
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CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG GGT TCT CAA GGT 
Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin Gly Ser Gin Gly 
440 445 450 

GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT GCT TCC CAA GCT 
Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser Ala Ser Gin Ala 
455 460 465 

GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC CAG CAC CGC TCT 
Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser Gin His Arq Ser 
470 475 480 - 

TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG ACA AGT CGC GGC 
Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly Thr Ser Arg Gly 
485 490 495 S 500 

CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG GAG AGT CGA GAC 
Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin Glu Ser Arg Asp 
505 510 515 

TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT GAC CAC GTG GAC 
Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His Asp His Val Asp 
520 525 530 

CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC GAG ACC CAC GGG 
His Tyr Ala Ser His Arg Asp His Asn His Arg Asp Glu Thr His Gly 
535 540 545 

AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT TCC CGG GAC GTG 
Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg Ser Arg Asp Val 
550 555 560 

GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG CGC AGC CGT CAT 
Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin Arg Ser Arg His 
565 570 575 580 

AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA GTG ATA TCA AAA 
Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu Val He Ser Lys 
585 590 595 

AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT TAC ATC CCC CAA 
Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val Tyr He Pro Gin 
600 605 610 

TGAGTTTTGC CCTTTTGTGT TTTTTTTTTT TTTTTTTTGA AGTCTTGTAT AACTAACAGC 

ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTGCAATC ATATGTGATC TGTCTTGTAA 217 8 

TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG GATAGAGTAT TGAGATACTT 223 8 

TTTCTTTTGT AAGTGCTACA TAAATTGGCC TGGTATGGCT GCAGTCCTCC GGTTGCATAC 2298 

TGGACTCTTC AAAAACTGTT TTGGGTAGCT GCCACTTGAA CAAAATCTGT TGCCACCCAG 23 58 
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GTGATGTTAG TGTTTTAAGA AATGTAGTTG ATGTATCCAA CAAGCCAGAA TCAGCACAGA 2418 

TAAAAAGTGG AATTTCTTGT TTCTCCAGAT TTTTAATACG TTAATACGCA GGCATCTGAT 2478 

TTGCATATTC ATTCATGGAC CACTGTTTCT TGCTTGTACC TCTGGCTGAC TAAATTTGGG 2538 

GACAGATTCA GTCTTGCCTT ACACAAAGGG GATCATAAAG TTAGAATCTA TTTTCTATGT 2598 

ACTAGTACTG TGTACTGTAT AGACAGTTTG TAAATGTTAT TTCTGCAAAC AAACACCTCC 2658 

TTATTATATA TAATATATAT ATATATATCA GTTTGATCAC ACTATTTTAG AGTC 2712 
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WHAT IS CLAIMED IS: 

1. An isolated DNA fragment, comprising a sequence of 
nucleotides that encodes an or t subunit selected from the group 
consisting of 0f 1A _ l# a^.j, a 1B . lt a ic _ 2 and a 1E _ 3 . 
5 2. The DNA fragment of claim l, wherein the a a subunit 

is a lh . x or a^.,. 

3. The DNA fragment of claim 1, wherein the a x subunit 
is or a 1E . 3 

4. The DNA fragment of claim l, wherein the a x subunit 
10 is a lc _ 2 . 

5. An isolated DNA fragment, comprising a sequence of 
nucleotides that encodes a £ subunit selected from the group 
consisting of (3 2 , /3 3 and /3 4 . 

6. The DNA fragment of claim 5, wherein the subunit is 
15 a /3 2c , 0 2D or fi„ subunit. 

7. The DNA fragment of claim 5, wherein the subunit is 
a /3 3 subunit. 

8. The DNA fragment of claim 7, wherein the subunit is 
a subunit. 

20 9- The DNA fragment of claim 5, wherein the subunit is 

a /S 4 subunit. 

10. The DNA fragment of claim 9, wherein the subunit has 
an amino acid sequence set forth in SEQ ID No. 28. 

11. A eukaryotic cell, comprising heterologous DNA that 
25 encodes an ar 3 subunit selected from the group of subunit s 

consisting of Qf 1A . a , a^j, ot lc . 3 r a 1E .j, and a 1E _ 3 . 

12. A eukaryotic cell, comprising heterologous DNA that 
encodes an subunit and heterologous DNA that encodes a /3 
subunit, wherein at least one subunit is selected from the 

30 group of subunits consisting of o 1A . lf a 1A _ 2 , a ic . 2 , ct XE . lt a is _ 3 , /3 2C , 
02V /S 2E , /S 3 . 1# a /3« subunit. 

13. The eukaryotic cell of claim 12, wherein the 0 
subunit is a /3 2 subunit. 

14. The eukaryotic cell of claim 12, wherein the (3 
35 subunit is a /3 4 subunit. 
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15. The eukaryotic cell of claim 11, selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 
cells, African green monkey cells, and mouse L cells. 

16 . The eukaryotic cell of claim 12 selected from the 
5 group consisting of HEK 293 cells, Chinese hamster ovary 

cells, African green monkey cells, and mouse L cells. 

17. A eukaryotic cell with a functional, heterologous 
calcium channel, produced by a process comprising: 

introducing into the cell heterologous nucleic acid that 
10 encodes an o^-subunit of a human calcium channel, wherein: 

the cKj subunit is selected from the group consisting of 
<*ia-i/ <* 1A _ 2 , a ic . 2 , and a 1E . 3 ; 

the heterologous calcium channel contains at least one 
subunit encoded by the heterologous nucleic acid; and 
15 the only heterologous ion channels are calcium channels. 

18. A eukaryotic cell with a functional, heterologous 
calcium channel, produced by a process comprising: 

introducing into the cell nucleic acid that encodes an 
a x subunit of a human calcium channel and introducing into the 
20 cell nucleic acid that encodes a /3 subunit of a human calcium 
channel , wherein : 

at least one of the subunits is s elected from the group 
consisting of a^^, or^^, a lz . lt ot lB . 3 , P 2 o Pzv P 2 b> a P 3 and a /?< 
subunit ; 

25 the heterologous calcium channel contains at least one 

subunit encoded by the heterologous nucleic acid; and 

the only heterologous ion channels are calcium channels. 

19. The eukaryotic cell of claim 17 selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 

3 0 cells, African green monkey cells, mouse L cells and amphibian 
oocytes . 

20. The eukaryotic cell of claim 18 selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 
cells, African green monkey cells, mouse L cells and amphibian 

3 5 oocytes. 
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21. The eukaryotic cell of claim 18, wherein the P 
subunit is a /3 2 , /3 3 or /3, subunit of a human calcium channel. 

22. The eukaryotic cell of claim 18, wherein the calcium 
channel includes an or 2b subunit of a human calcium channel, an 

5 a^.j subunit of a human calcium channel and a 0 3 subunit of a 
human calcium channel. 

23. The eukaryotic cell of claim 18, wherein the calcium 
channel includes an or 1B1 ' <*2b» and a (i x _ 2 subunit, or an a 1B . lt 
or 2b> and a /3j.. 3 subunit, or an a 1B _ 2 , a 2b , and a jS^j subunit, or 

10 an a 1A _ 2 , or 2b/ and a /3 3 .., subunit, or a at 1B _ lf a 2b , and an 
subunit . 

24. The eukaryotic cell of claim IB, wherein the 
calcium channel contains an a 2b subunit of a human calcium 
channel, an a 1B or an a 1D subunit of a human calcium channel and 

15 a Pj.j, /3,_ 2 or /5j.j subunit of a human calcium channel. 

25 . A method for identifying a compound that modulates 
the activity of a calcium channel, comprising; 

suspending a eukaryotic cell that has a functional, 
heterologous calcium channel, in a solution containing the 
20 compound and a calcium channel -selective ion: 

depolarizing the cell membrane of the cell; and 
detecting the current flowing into the cell, 
wherein: 

the heterologous calcium channel includes at least one 
25 human calcium channel subunit encoded by DNA or RNA that is 

heterologous to the cell,- 

at least one subunit is selected from the group 

consisting of otn-n £*ia-2. «ie-i» a iE-3* a ic-2> Pia /^d/ /3 2e » a Pi 

subunit and a /3 4 subunit; 
3 0 the current that is detected is different from that 

produced by depolarizing the same or a substantially identical 

cell in the presence of the same calcium channel selective ion 

but in the absence of the compound. 

26. The method of claim 25, wherein the heterologous DNA 
3 5 or RNA encodes a /S 3 subunit. 
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27. The method of claim 26, wherein the heterologous DNA 
or RNA encodes a 0 4 subunit. 

28. A subunit -specific antibody selected from the group 
consisting of antibodies that bind to an or subunit type or a 

5 subunit subtype of a human calcium channels, wherein the 
subunit is an a x subunit . 

29. The antibody of claim 28, wherein antibody is 
subtype specific and the a y subunit is c* 1A , a 1E and ck 1b . 

30. An RNA or single- stranded DNA probe of at least 16 
10 bases in length comprising at least 16 substantially 

contiguous bases from nucleic acids that encode a subunit of 
a human calcium channel selected from the group of subunits 
consisting of Qf^.j, a 1E . lt a ic . 2 , a 1£ _ 3 , P 3 . lt /3 2C , £ 2D , 0 2E and 

15 31. The probe of claim 30 that contains at least 30 

bases that are from nucleic acids that encode a subunit of a 
human calcium channel selected from the group of subunits 
consisting of a 1A .i, or^, <*ie-i/ Qic-j. <* 1E _ 3 , 03-i> Pza 02d» 0 2 e and 
/3« subunits . 

20 32. A method for identifying nucleic acids that encode 

a human calcium channel subunit, comprising hybridizing under 
conditions of at least low stringency a probe of claim 3 0 to 
a library of nucleic acid fragments, and selecting hybridizing 
fragments . . 

25 33 . A method for identifying cells or tissues that 

express a calcium channel subunit -encoding nucleic acid, 
comprising hybridizing under conditions of at least low 
stringency a probe of claim 3 0 with mRNA expressed in the 
cells or tissues or cDNA produced from the mRNA, and thereby 

30 identifying cells or tissue that express mRNA that encodes the 
subunit . 

34. A substantially pure human calcium channel subunit 
selected from the group consisting of a 1A . 1( a 1A . 2 , a 1E . lt a ic _ 2 , a 1E . 
3. 03-1. 02c /3 2E and /3« . 
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